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DR01006 C TCGAG— TCTGGGGGAGG 

DR27006 C TCGAG— TCTGGGGGAGG 

DR03006 C AGGTGA AACTGCTCGAG — TCTGGAGGAGG 

DR11006 C TCGAG— TCTGGGGGAGG 

DR2W06 C AGGTGA AACTGCTCGAG— TCTGGGGGAGG 

DR16006 C TCGAG— TCTGGAGGAGG 

DR19006 C TCGAG— TCTGGAGGAGG 

DR07006 C TCGAG— TCTGGGGGAGG 

DR16006 C TCGAG— TCTGGGGGAGG 

DR20006 C TCGAG— TCAGGGGGAGG 

DR25006 C TCGAG— TCTGGGGGAGG 

DR20006 C TCGAG— TCTGGAGGAGG 

DR21006 C TCGAG— TCTGGGGGAGG 

DR09006 C AGGTGA AACTGCTCGAG— TCTGGGGGAGG 

DR17006 C TCGAG— TCTGGGGGAGG 

DR13006 C TCGAG— TCAGGGGGAGG 

DR02006 CTCGAGTCAGGTGTCCGGTCTGATGTGCAGCTGGTGGCGTCTGGGGGAGG 

DR01006 ATCGGTGCAGGCTGGAGGGTCTCTGAGACTCTC--GTGCG-CAGCCTCTG 

DR27006 CTCGGTGCAGGCTGGAGGGTCTCTGA6ACTCTCCTGTGCATCTTCTTCTA 

DR03006 CTCGGTGCAGACTGGAGGATCTCTGAGACTCTCCTGTGCAGT--C-TCTG 

DR11006 GTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTAATGT— C-TCTG 

DR24006 GTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTAATGT-C-TCTG 

DR16006 CTCGGCGCAGGCTGGAGGATCTCTGAGACTCTCCTGTGCAGC— CCACGG 

DR19006 CTCGGTTCAGGCTGGAGGGTCCCTTAGACTCTCCTGTGCAGC--C-TCTG 

DR07006 CTCGGTGCAGGGTGGAGGGTCTCTGAGACTCTCCTGTGCAA— TCTCTG 

DR16006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTACAG — GCTCTG 

DR20006 CTCGGTACAGGTTGGAGGGTCTCTGAGACTCTCCTGTGTAG— CCTCTA 

DR25006 CTCGGTACAAACTGGAGGGTCTCTGAGACTCTCTTGCG — AAATCTCTG 

DR20006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTG — TAGCCTCTG 

DR21006 CTCGGTGCAGGTTGGAGGGTCTCTGAAACTCTCCTGTAAAAT---CTCTG 

DR09006 CTCGGTGCAGGCTGGGGGGTCTCTGACACTCTCTTGTG---TATACAC-- 

DR17006 CTCGGTCCAACCTGGAGGATCTCTGACACTCTCCTGTACAGTT---TCTG 

DR13006 CTCGGTGGAGGCTGGAGGGTCTCTGAGACTCTCCTGTACAG — CCTCTG 

DR02006 CTCGGTGCAGGCTGGAGGCTCTCTGAGACTCTCCTGTACAG — CCTCTG 

DR01006 GA--TACAGTAATT-— GTCCCCTCACTTG-GAGCTGGTATCGCCAGTTT 

DR27006 AA— TATATGCCTT— GCACCTACGACAT-GACCTGGTACCGCCAGGCT 

DR03006 GA-TTCTCCTTTA— GTACCAGTTGTAT-GGCCTGGTTCCGCCAGGCT 

DR11006 GC— TCTCCCAGTA— GTACTTATTGCCT-GGGCTGGTTCCGCCAGGCT 

DR24006 GC— TCTCCCAGTA — GTACTTATTGCCT-GGGCTGGTTCCGCCAGGCT 

DR16006 GA— TTCCGC-TCA — ATGGTTACTACAT-CGCCTGGTTCCGTCAGGCT 

DR19006 AC— TACACCATCA— -CTGATTATTGCAT-GGCCTGGTTCCGCCAGGCT 

DR07006 GA— TACACGTACG— GTAGCTTCTGTAT-GGGCTGGTTCCGCGAGGGT 

DR16006 GA — TTCCCCTATA — GTACCTTCTGTCT-GGGGTGGTTCC6CCA6GCT 

DR20006 CT— CACACCGACA — GTAGCACCTGTAT-AGGCTGGTTCCGCCAGGCT 

DR25006 GA — TTGACTTTTG — ATGATTCTGACGT-GGGGTGGTACCGCCAGGCT 

DR20006 GA— TTCAATTTCG— AAACTTCTCGTAT-GGCGTGGTACCGCCAGACT 

DR21006 GAGGTACCCCAGATCGTGTTCCTAAATCTTTGGCCTGGTTCCGCCAGGCT 

DR09006 CAACGATACTGGGACCA TGGGATGGTTTCGCCAGGCT 

DR17006 --GGGCCACCTACA — GTGACTACAGTATTG6A-TGGATCCGCCAGGCT 

DR13006 G ATACGTAT-CCT CTATGGCCTGGTTCCGCCAGGTT 

DR02006 GAGA CAGTTTCAGTAGATT— TGCCATGTCTTGGTTCCGCCAGGCT 

FIG. 7A 
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CCAGGAACGGAGCGCGAGTTCGTCTCCAGTTO 

ccaggcaaggagcgcgaatttgtctcaagta™^ 

TCAGGAAAGCAGCGTGAGGGGGTCGCAGCCAnAATAGTGGCGGTG^ 

CCAGGGAGGGAGCGTGAGGGGGTCACAGCGATTAA CACTGATGG 

CCAGGGAAGGA6CGTGAGGGGGTCACAGCGAHAA------W 

CCTGGGAAGGGGCGTGAGGGGGTXGCAACM^ 
CCAGGGAAGGAGCGTGAATTGGTCGCAGCGAnCAAGTTGTCCGT^ 
CCAGGCAAGGAACGTGAGGGGATCGCAACTATO 
CCAGGGAAGGAGCGTGAGGGGGTCGCGGGTATTAATAGTGCAGGAOT 
CCAGGGAAGGAGCGCGAGGGGGTCGCAAGTATATATTTTGGTGATO^ 
CCAGGGCATGAGTGCAAATTGGTCTCAGGTATTCTGAGTGATGGTACT-C 



CCAGGAAATGTGTGTGAGTTGG' 
CCAGAGAAGGAGCGCGAGGGGA 
CCAGGGAAAGAGTGCGAAAGGG 



CCAGGGAAGGACCGTGAAGTAGTCGCAGCCGCTAATACTGGTG 



ccagggcaggagcgcgaggggg; 



CTCAAGTATTTACAGTGATGG- 
CGCAGTTCTTTCGACTAAGGATGGTAA 
CGCGCATATTACGCCTGATGGTATGA- 



CGCGTTTGTTCAAACGG- 



CCAGGGAAGGAGTGCGAATTGGTCTCAAGCATTCAAAGTAATGGAAGGAC 

CAAGTACA CATACTCCGTGAAGGGCCGCTTCACC 

AACATACG CAGACTCCGTGAAGGGCCGATTCACC 

GACATACTA-CAACACATATGTCGCCGAGTCCGTGAAGGGCCGAnW 

CAGTATCAT-ATACGCA GCCGACTCCGTGAAGGGCCGATTC A CC 

CAGTGTCAT-ATACGCA GCCGACTCCGTGAAGGGCCGATTCACC 

CGA-CGTCACATACTACGCCGACTCCGTGACGGGCCGATTTACC 

TACT--CGC-C-TCACAGACTACGCCGACTCCGTGAAGGGACGATTCACC 

CACATACTATGCCGACTCGGTGAAGGGCCGATTCACC 

TACTTACTATGCCGACGCCGTGAAGGGCCGATTCACC 

TACGAATTATCGCGACTCCGTGAAGGGCCGATTCACC 

CATATACAAAGAGTGGAGACTATGCTGAGTCTGTGAGGGGCCGGGTTACC 

CA-AAACATACTACGTCGACC--GCA ™AGGGCCGATTCACC 

GA CATTCTATGCCGACTCCGTGAAGGGCCGATTCACC 

CCTTCATTGATGAACCCGTGAAGGGGCGATTCACG 

CGACTAGTAAATTCTACGTCGACnTGTGAAGGGCCGATTCACC 

--CTGACAAT-AGTGCATTATATGGCGACTCCGTGAAGGGCCGAnCACC 
AACTGA GGCCGATTCCGTGCAAGGCCGATTCACC 

ATGTCCCGAGGCAGCACCGAGTACACAGTATTTCTGCAAATGGACAATCT 

ATCTCCCAAGACAGCGCCAAGAACACGGTGTATCTGCAGATGAACAGCCT 

ATCTCCCAAGACAACGCCAAGACCACGGTATATCnGATATGAACAACCT 

ATCTCCCAAGACACCGCCAAGGAAACGGTACATCTCCAGATGAACA^ 

ATCTCCCAAGACACCGCCAAGAAAACGGTATATCTCCAGATGAACAACCT 

ATCTCCCGAGACAGCCCCAAGAATACGGTGTATCTGCAGATGAACA^ 

ATCTCCCAAGGCAACACCAAGAACACAGTGAATCTGCAAATGAACAGCCT 

ATCTCCCAAGACAGCACGTTGAAGACGATGTATCTGCTAATGAACAA^ 

ATCTCCCAAGGGAATGCCAAGAATACGGTGTTTCTGCAAATGGATAACTT 

ATCTCCCAACTCAACGCCCAGAACACAGTGTATCTGCAAATGAACAGCCT 

ATCTCCAGAGACAACGCCAAGAACATGATATACCTTCAAATGAACGACCT 

AnTCTAGAGAGAATGCCAAGAATACATTGTATCTACAACTGAGCGGCa 

ATCTTCnAGATAATGACAAGACCACTTTCTCCTTACAACTTGATCGACT 

ATCTCCCGAGACAACGCCCAGAAAACGTTGTCnTGCGAATGAATAGTCT 

ATTTCCCAAGACAACGCCAAGAATACGGTATATCTGCAAATGAGCTTCCT 

ATCTCCCACGACAACGCCAAGAACACGCTGTATCTGCAAATGCGCAACCT 

ATCTCCCGAGACAATTCCAGGAACACAGTGTATCTGCAAATGAACAGCCT 

FIG. 7B 
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DR01006 GAAACCTGAGGACACGGCGATGTATTACTGTAAAAC-A— GCCCTAC-- 

DR27006 GAAACCTGAGGACACGGCGATGTATTACTGTAAAAT-A — GA--TTC-- 

DR03006 AACCCCTGAAGACACGGCTACGTATTACTGTGCGGCGG— TCCCAGCCC 

DR11006 GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA — GACTGACGG 

DR24006 GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA — GACTGACGG 

DR16006 GAAACCTGAGGACACGGCCATCTACTTCTGT6CAGCAG — G CTC 

DR19006 GACACCTGAGGACACGGCCATCTACAGTTGTGCGGCAA — C CAG 

DR07006 GAAACCTGAAGACACGGGCACCTATTACTGTGCTG-CA— GAACTAAGT 

DR16006 GAAACCTGAGGACACGGCCATCTATTACTGCGCGG-CG — GATAGTCCA 

DR20006 GAAACCTGAGGACAGCGCCATGTACTACTGTGCAATCA — CTGAAATTG 

DR25006 GAAACCTGAGGACACGGCCATGTATTACTGCGCGGTAGATGGTTGGACCC 

DR20006 CAAACCTGAGGACACGGCCATGTATTACTGTGCG CC 

DR21006 GAACCCGGAGGACACTGCCGACTACTACTGCGCTGCAAATCAATTAGC-- 

DR09006 GAGGCCTGAGGACACGGCCGTGTATTACTGTGCGGCAGATTG 

DR17006 GAAACCTGAGGACACGGCCATCTATTACTGTGCGGCAG CGGACCC 

DR13006 GCAACCTGACGACACTGGCGTGTACTACTGTGCGGCC CAA 

DR02006 GAAACCCGAGGACACGGCCGTGTATTACTGTGGGGCAGT 

DR01006 A-AC-- CTGGGGGTTATTGTGGGTA- 

DR27006 GTAC— CCGTGCCATCTCCTTGATG- 

DR03006 ACTTGGGACCT GGCG-CCATT CTT6ATTTG 

DR11006 AGATGGGGGCTTGTGATGCGAGATGG6CGACCTTAGC-GACAAGGAC-G 

DR24006 AGATGGGGGCTTGTGATGCGAGATGGGCGACCTTAGC— GACAAGGAC-G 

DR16006 GCGTTTTT-CTAGTCCTGTTGGGAGCACTTC-TAGAC — TCGAAAGTAG 

DR19006 TAGTTTTTA CTGGTA CT GCAC C — ACG G 

DR07006 GGTGGTAGTTGTGAATTGC — CTTTGC TATTTGACTA 

DR16006 TGTTACATGCCGACTATGC — CC6CTCCCCCGATACGAGACAGTTTTGG 

DR20006 AGTGGTATGGGTGCAATTT — AAGGACTACTTTTACT — C G 

DR25006 GGAAGGAAG— GGGGAATCGGGTTAC CCTGGTCGGTCCAATGTGAA 

DR20006 GGTTGAA TATC CTATTGCAGAC— ATGTGTT 

DR21006 — TGGTGGCTGGTATT TGGACCCGAATTACTGG-CTCTCTGTG 

DR09006 — GAAATACTGGA CTTGTGGTGC--CCAGA-CTGG AG 

DR17006 AAGTATATATTATAGTATC CTCCNNAT 

DR13006 AAGAAGGATCGTA CTAGATGGGC CGAGCCT 

DR02006 CTCCCTAA— TGGACCGAATTTC 

DR01006 -TGGGTANTGCCTCTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR27006 -T CTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR03006 AAAAAGTATAAGTACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR11006 TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCTCCTCACT 

DR24006 TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCTCCTCACT 

DR16006 CGA-CT-ATAACTATTGGGGCCAGGGGATCCAGGTCACCGTCACCTCACT 

DR19006 CGC-CTTATAACGTCTGGGGTCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR07006 CTGGG GCCAGGGCACCCAGGTCACCGTCTCCTCACT 

DR16006 CTGGGATGATTTT GGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR20006 CTGGG GCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR25006 GATGGTTATAACTATTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCAC- 

DR20006 CGAGAT ACG — GCGACCCGGGGACCCAGGTCACCGTCTCCTCAC- 

DR21006 GGTGCATATGCCATCTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCAC- 

DR09006 GATACTTCGGACAG-TGGGGTCAGGGGGCCCAGGTCACCGTCTCCTCACT 

DR17006 — TGAGTATAAGTACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCA-- 

DR13006 CGAGAATGGAACAACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCA- 

DR02006 CCAACATGGG— TGCCGGGGCCA6G6AACCCAGGTCACCGTCTCCT 



FIG. 7C 
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DR01006 AG TTACCC6TACGACGTTCCGGACTACGGTTCTTAATA6AATTC 

DR27006 AG TTACCCGTACGAGCTTCCGGACTACGGTTCTTAATAGAATTC 

DR03006 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR11006 AG TTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR2W06 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR16006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR19006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR07006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR16006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR20006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR25006 — TAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR20006 — TAGTTACCCGTACGACGAACCGGACTACGGTTCTTAATAGAATTC 

DR21006 — TAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR09006 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR17006 

DR13006 

DR02006 TA 



FIG. 7D 
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IMMUNOGLOBULINS DEVOID OF LIGHT Ward et al have described in their publication that V„ 

CHAINS domains are relatively sticky, presumably due to the exposed 
hydrophobic surface normally capped by the or 

This is a division of application Ser. No. 08/106,9-44, domains. They consequently envisage that it should be 

filed Aug. 17, 1993 now abandoned. 5 possible to design V H domains having improved properties 

The invention relates to new isolated immunoglobulins an( j further that V H domains with binding activities could 

which are devoid of light polypeptide chains. These immu- serve as fce building blocks for making variable fragments 

noglobulins do not consist in the degradation products of ^ fragments) or complete antibodies, 

immunoglobulins composed of both heavy polypeptide and ^ ^^,1 ^ not start from the idea that the 

light polypeptide chains but to the contrary, toe invention 1Q fr nts (hght and heavy chains) and the different 

defines a new member of the family of the ^„^ s of mese frag ^ nts of four-chain modd immuno- 

immunogtobulins, especially a new _ typeof molecules ^ new or an%en 

capable of being involved in the immune recognition. Such .... „ . . . , ... 

inmiunoglobulifs can be used for several purposes, espe- b ^ sltes or a fow-cbzm model mimunoglobuhn. 

cially for diagnosis or therapeutical purposes including ™ e ™ e «°5 s have determined that mununoglobuhns 

protection against pathological agents or regulation of the 15 <*n have a different structure than the known four-chain 

expression Ir activity of proteins. model and that such different immunoglobulins offer new 

Up to now the structure proposed for immunoglobulins means for the preparation of diagnosis reagents, therapeu- 

consists of a four-chain model referring to the presence of tical agents or any other reagent for use in research or 

two identical light polypeptide chains (light chains) and two industrial purposes. 

identical heavy polypeptide chains (heavy chains) linked 20 Thus the invention provides new immunoglobulins 

together by disulfide bonds to form a y- or T-shaped mac- which are capable of showing functional properties of 

romolecules. These chains are composed of a constant four-chain model immunoglobulins although their structure 

region and a variable region, the constant region being appears to be more appropriate in many circumstances for 

subdivided in several domains. The two heavy polypeptide their use, their preparation and in some cases for their 

chains are usually linked by disulpbide bounds in a so-called 25 modification. Moreover these molecules can be considered 

"hinge region" situated between the first and second as lead structures for the modification of other immunoglo- 

domains of the constant region. bulins. The advantages which are provided by these immu- 

Among the proteins forming the class of the noglobulins comprise the possibility to prepare them withan 

immunoglobulins , most of them are antibodies and accord- increased facility. 

ingly present an antigen binding site or several antigen 30 The invention accordingly relates to immunoglobulins 

binding sites. characterized in that they comprise two heavy polypeptide 

According to the four-chain model, the antigen binding chains sufficient for the formation of a complete antigen 

site of an antibody is located in the variable domains of each binding site or several antigen binding sites, these immu- 

of the heavy and tight chains, and requires the association of noglobulins being further devoid of tight polypeptide chains, 

the heavy and the light chains variable domains. 35 In a particular embodiment of the invention, these immu- 

For the definition of these four-chain model nogfcbulmsarefurmerdiaracter^^ 

immunoglobulins, reference is made to Roitt I et al the product of the expression in a prokaryotic or in a 

(Immunology-second-Edition Gower Medical Publishing eukaryotic host cell, of a DNA or of a cDNA having the 

USA, 1989). Reference is especially made to the part sequence of an immunoglobulin devoid of light chains as 

concerning the definition of the four-chain 40 obtainable from lymphocytes or other cells of Camelids. 

irnmunoglobulins, their polypepudic and genetic structures, The immunoglobulins of the invention can be obtained 

the definition of their variable and constant regions and the for example from the sequences which are described in FIG. 

obtention of the fragments produced by enzymatic degra- 7. 

dation according to well known techniques. The immunoglobulins of the invention, which are devoid 

The inventors have surprisingly established that different 45 of light chains are such that the variable domains of their 

molecules can be isolated from animals which naturally heavy chains have properties differing from those of the 

produce them, which molecules have functional properties four-chain immunoglobulin V^. The variable domain of a 

of immunoglobulins these functions being in some cases heavy -chain immunoglobulin of the invention has no normal 

related to structural elements which are distinct from those interaction sites with the V t or with the C H l domain which 

involved in the function of four-chain immunoglobulins due 50 do not exist in the heavy chain immunoglobulins, it is hence 

for instance to the absence of light chains. a novel fragment in many of its properties such as solubility 

The invention relates to two-chain model immunoglo- and position of the binding site. For clarity reasons we will 

bulins which neither correspond to fragments obtained for call it V ffl in this text to distinguish it from the classical W H 

instance by the degradation in particular the enzymatic of four-chain immunoglobulins. 

degradation of a natural four-chain model immunoglobulin, 55 By "a complete antigen binding site" it is meant accord- 
nor correspond to the expression in host cells, of DNA ing to the invention, a site which will alone allow the 

coding for the constant or the variable region of a natural recognition and complete binding of an antigen. This could 

four-chain model immunoglobulin or a part of these regions, be verified by any known method regarding the testing of the 

nor correspond to antibodies produced in lymphopaties for binding affinity. 

example in mice, rats or human. 60 These immunoglobulins which can be prepared by the 

E. S. Ward et al (1) nave described some experiments technique of recombinant DNA, or isolated from animals, 

performed on variable domains of heavy polypeptide chains will be sometimes called "heavy-chain immunoglobulins" in 

(y H ) or/and light polypeptide chains (Vj/Fy) to test the the following pages. In a preferred embodiment of the 

ability of these variable domains, to bind specific antigens. invention, these immunoglobulins are in a pure form. 
For this purpose, a library of \ H genes was prepared from 65 In a first embodiment, the immunoglobulins of the inven- 

the spleen genomic DNA of mice previously immunized tion are obtainable in prokaryotic cells, especially in E. coli 

with these specific antigens. cells by a process comprising the steps of: 
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a) cloning in a Bluecript vector of a DNA or cDNA A particular Immunoglobulin is characterized in that it has 
sequence coding for the V WJ/ domain of an immune- a catalytic activity, especially in that it is directed against an 
globulin devoid of light chain obtainable for instance antigen mimicking an activiated state of a given substrate, 
from lymphocytes of Camelids, These catalytic antibodies can be modified at the level of 

b) recovering the cloned fragment after amplification 5 their biding site, by random or directed mutagenesis in order 
using a 5' primer containing an Xho site and a 3' primer t o increase oe modify their catalytic function. Reference 
containing the Spe site having the following sequence j^y be made to the publication of Lerner et al (TIBS 

TC TTA ACT ACT GAG GAG ACG GTG ACC TG, SEQ ID ^ <* «** technique for the 

NO:51 v preparation of such catalytic immunoglobulins. 

10 According to a preferred embodiment, the immunoglo- 

c) cloning the recovered fragment in phase in the immuno bulins of the invention are characterized in that their variable 
PBS vector after digestion of the vector with Xho and regions contain in position 45, an amino-acid which is 
Spe restriction enzymes, different from leucine, proline or glutamine residue. 

d) transforming host cells, especially E. coli by transfec- Moreover the heavy-chain immunoglobulins are not prod- 
tion with the recombinant immuno PBS vector of step 15 ucts characteristic of lymphocytes of animals nor from 
c, lymphocytes of a human patient suffering from lymphop- 

e) recovering the expression product of the V HH coding amies. Such immunoglobulins produced in lymphopathies 
sequence, for instance by using antibodies raised are monoclonal in origin and result from pathogenic muta- 
against the dromadary domain. tions at the genomic level. They have apparently no antigen 

In another embodiment the immunoglobulins are hetero- 20 binding site, 

specific immunoglobulins obtainable by a process compris- The two heavy polypeptide chains of these immunoglo- 

ing the steps of: bulins can be linked by a hinge region according to the 

obtaining a first DNA or cDNA sequence coding for a definition of Roitt et al. 

Vhh domain or part thereof having a determined sped- in a particular embodiment of the invention, immunoglo- 

ficity against a given antigen and comprised between 25 bulins corresponding to the above-defined molecules are 

Xho and Spe sites, capable of acting as antibodies. 

obtaining a second DNA or cDNA sequence coding for a The antigen binding site(s) of the immunoglobulins of the 

V m domain or part thereof, having a determined invention are located in the variable region of the heavy 

specificity different from the specificity of the first chailL 

DNA or cDNA sequence and comprised between the w to , ffaap of mese ^^^0^ each 



digesting an immuno PBS vector with EcoRI and Xhol 

restriction enzymes, 
li gating the obtained DNA or cDNA sequences coding for 



heavy polypeptide chain contains one antigen binding site 
on its variable region, and these sites correspond to the sa 



In a further embodiment of the invention the iiuiuuuu- 

V„„ domains, so that the DNA or cDNA sequences are ,< ..,,■„ .„„„... ■ , ■ . , „„, : j„ 

serkllv cloned in the vector. 35 m charactenzed m that their heavy polypeptide 



serially cloned in the vector, 
transforming a host cell, especially E. coli cell by 
transection, and recovering the obtained immunoglo- 
bulins. 

In another embodiment, the immunoglobulins are obtain- 
able by a process comprising the steps of: 



chains contain a variable region (V ww ) and a constant legion 
(C„) according to the definition of Roitt et al, but are devoid 
of the first domain of their constant region. This first domain 
of the constant region is called C„l. 
3 These inununoglobulins having no C„l domain are such 

'"-J'. " 1 ""7"**^""' * „ that the variable region of their chains is directly linked to 

obtaimng a DNA or cDNA sequence coding for a K ^ Qn ^ £ c ^ ^ me y^^on. 

of the type described hereabove 



r part thereof, having a determined specific 
antigen binding site, 

amplifymgtrieobtamedDNAOTcDNAusmgaS'prmier _ 

containing an initiation codon and a Hindm site, and a class 2 0gG2) or unmunoglobulins of class 3~(IgG3). 

3' primer containing a termination codon having a Xhol The absence of me light cf ' ' 

site ' domain lead to a modification of the nomenclature of the 

recombining the amplified DNA or cDNA into the Hindm immunoglobulin fragments obtained by enzymatic 

(position 2650) and Xhol (position 4067) sites of a x digestion, according to Roitt et al. 

plasmid pMM984, The terms Fc and pFc on the one hand, Fc 1 and pFc' on the 

transfecting permissive cells especially NB-E cells with other hand corresponding respectively to the papain and 

the rec ombin a nt plasmid, pepsin digestion fragments are maintained, 

recovering the obtained products. The terms Fab F(ab) 2 F(ab') 2 Fabc, Fd and Fv are no 

Successful expression can be verified with antibodies 55 longer applicable in their original sense as these fragments 

directed against a region of a domain, especially by an have either a light chain, the variable part of the light chain 

ELBA assay. or the C„l domain. 

According to another particular embodiment of mis The fragments obtained by papain digestion and corn- 
process, the immunoglobulins are cloned in a parvovirus. posed of the ~V HH domain and the hinge region will be called 
In another example these immunoglobulins are obtainable 60 FV HH h or F(V^h) 2 depending upon whether or not they 
by a process comprising the further cloning of a second remain linked by the disulphide bonds. 
DNA or cDNA sequence having another determined antigen In another embodiment of the invention, immunoglobu- 
binding site, in the pMM984 plasmid. tins replying to the hereabove given definitions can be 
Such an Immunoglobulin can be further characterized in originating from animals especially from animals of the 
that it is obtainable by a process wherein the vector is Yep 65 cametid family. The inventors have found out that the 
52 and the transformed recombinant cell is a yeast especially heavy-chain immunoglobulins which are present in cam- 
S- cerevisiae. elids are not associated with a pathological situation which 
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would induce the production of abnormal antibodies with has a molecular weight of around 100 kd after elution with 

respect to the four-chain immunoglobulins. On the basis of a pH 3 5 buffer (0 .15M NaCl, 0.58% acetic acid), 
a comparative study of old world camelids {Camelus bac- 

trianus and Camelus dwmaderius) and new world camelids is adsorbed by chromatography on a Protein G Sepharose 

(for example Lama Paccos, Lama Glama, and Lama 5 column and eluted with pH 3.5 buffer (0.15M NaCl, 

Vicuqna) the inventors have shown mat the immunoglobu- „ 5g „ acetic acid 1 * 

lins of the invention, which are devoid of light polypeptide ' ° '' 

chains are found in all species. Nevertheless differences may consists of heavy y3 polypeptide chains of a molecular 

o^STonVT^ wei g htofaround45Kdinparticularbetween43and47 

of a heavy chain contained in these immunoglobulins can be 10 kd after reduction. 

from approximately 43 kd to approximately 47 kd, in jhe immunoglobulins of the invention which are devoid 

particular 45 kd. of Ught chains, nevermdess comprise on meir heavy chains 

Advantageously the heavy-chain immunoglobulins of the ^ ^ ' 

invention are secreted in blood of camelids. a constant region and a variable region. The constant region 

Immunoglobulins according to this particular embodi- 13 comprises different domains, 

ment of the invention are obtainable by purification from ^ • n rf fc^^,^ of ^ iwfsuSion 

serum of camelids and a process for the purification is ik»«uub«8i«»»™» b 1 

described in details in the examples. In the case where the comprises frameworks (FW) and complementarity deter- 

immiinoglobulins are obtained from Camelids, the invention mining regions (CDR), especially 4 frameworks and 3 

relates to immunoglobulins which are not in their natural 20 complementarity regions. It is distinguished from the four- 
biological environment 



According to the invention immunoglobulin IgG2 as 



chain immunoglobulins especially by the fact that this 



obtainable by purification from the serum of camelids can be variable region can itself contain an antigen binding site or 

characterized in mat: several, without contribution of the variable region of a light 
it is not adsorbed by chromatography on Protein G 2 s chain which is absent. 

Sepharose column, 
it is adsorbed by chromatography on Protein A Sepharose 

column, prise among others respectively amino-acid sequences 

it has a molecular weight of around 100 kd after elution which can be selected from the following: 
with a pH 4.5 buffer (0.15M NaCl, 0.58% acetic acid ^ , , J . 

adjusted to pH 4.5 by NaOH), for framework 1 domain 



GGSVQTGGSLRL 
GGSVQTGGSLRL 
GGSEQGGGSLRL 
GGSVQPGGSLTL 
GGSVQAGGSLRL 
GGSVQAGGSLRL 
GGSVQAGGSLRL 



SCE I SGLTFD SEQ ID NO: 1 

SCAVSGFSFS SBQIDNO:2 

SCA I SGYTYG SEQIDNO:3 

SCTVSGATYS SEQIDNO:4 

SCTGSGFPYS SEQIDNO:5 

SCVAGFGTS SEQIDNO:6 

SCVSFSPSS SEQIDNO:7 



it consists of heavy y2 polypeptide chains of a molecular for 
weight of around 46 kd preferably 45 after reduction. 

According to a further embodiment of the invention w gqgtqvtvs s seqidno:8 

another group of immunoglobulins corresponding to IgG3, w gqgtlvtvss seq id no£ 

^< A ^X^y^ C ^o a fro^^oi CaB ^is w GQGTQVTAS S Sn>5£i? 

characterized in that me immunoglobulin: R gqgtqvtvsl seq id NO:i2 

is adsorbed by chromatography on a Protein A Sepharose ^ " —— ~ " ~~" ~~ — 



ALQPGGYCGYGX C L SEQ ID NO:62 

V S L M D R I S QH G C SEQ ID NO:63 

VPAHLGPGAILDLKKY — — — — — — KY SEQIDNO:«4 

F CY S TAGDGG SGE— — — — — MY SEQ ID NO:65 

ELSGGSCELPLLF — — — — — DY SEQ ID NO:66 

DWKYWTCGAQTGGYF — — — — — G Q SEQ ID NO:67 

RLTEMGACDARWATLATRTFAYNY SEQIDNO:68 

Q K K D R T RWAE P REW — — — — — — NN SEQ ID NO:69 

GSRFSSPVGSTSRLES — SDY — — NY SEQ ID NO:70 

ADP S I YYS ILXIEY — — — — — — KY SEQ ID NO:71 

D S P CYMPTMPAPP I RDSFGW — — DD SEQ ID NO:72 

T S S FYWYCTTAPY — — — — — — — N V SEQ ID NO:73 

TE I EWYGCNLRTTF — — — — — — — — TR SEQ ID NO:74 

NQLAGGWYLD P NYWL SVGAY — — A I SEQ ID NO:75 

RLTEMGACDARWATLATRTFAYNY SEQ ID NO:76 

DGWTRKEGGIGLPWSVQCEDGYNY SEQ ID NO:77 
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As stated above, <he immunoglobulins of the invention 
are preferably devoid of the totality of their C w l domain. 

Such immunoglobulins comprise C^2 and C H 3 domains 
in the C-terminal region with respect to the hinge region, j 

According to a particular einbodiment of the invention the 
constant region of the immunoglobulins comprises C„2 ai 
CjjS domains comprising an 
from the following: 

for the C H 2 domain: 



APEIXGGPTVHFPPKPKDVLSnijrP SEQ ID NO:31 

AfELPGGPSVFVTPTCPKDVLSISGRP SEQ ID NO:32 

APELPGQPSVFVFPPKPKDVLSISaRP SEQ ID NO:33 

APELLOGPSVFIFPPKPKDVLSISGRP SEQ ID NO:34 



for the Cjj3 domain: 



GQTREPQVYTLA 
GQTREPQVYTLAPXRLEL 
GQPREPQVYTLPPSRDEL 
GQPREPQVYTLPPSREEM 



SEQ ID NO:35 
SEQIDNO:36 
SEQ ID NO: 109 
SEQIDNO:110 
SEQIDNO:lll 



Interestingly the inventors have shown that the hinge 3 
region of the immunoglobulins of the invention can present 
variable lengths. When these immunoglobulins act as 
antibodies, the length of the hinge region will participate to 
me determination of the distance separating the antigen 
binding sites. 3 

Preferably an immunoglobulin according to the invention 
is characterized in that its hinge region comprises from 0 to 



Particular sequences of hinge region of the immunoglo- 
bulins of the invention are the following. 4 



CTNEVCKCPKCP 



EPKIPQPQPKPQPQPQPQPKPQPKPEPECrCPKCP SEQ ID NO:38 

The short hinge region corresponds to an IgG3 molecule so 
and the long hinge sequence corresponds to an IgG2 mol- 
ecule. 

Isolated V w derived from heavy chain immunoglobulins 
or libraries corresponding to the heavy chain immu- 
noglobulins can be distinguished from V ffl cloning of 55 
four-chain model immunoglobulins on the basis of sequence 
features characterizing heavy chain immunoglobulins. 

The camel heavy-chain immunoglobulin V„„ region 
shows a number of differences with the regions derived 
from 4-chain immunoglobulins from all species examined. 60 
At the levels of the residues involved in the V^^/V^ 
interactions, an important difference is noted at the level of 
position 45 (FW) which is practically always leucine in the 
4-chain immunoglobulins (98%), the other amino acids at 
this position being proline (1%) or glutamine (1%). 65 

In the camel heavy-chain immunoglobulin, in the 
sequences examined at present, leucine at position 45 is only 



found once. It could originate from a four-chain immuno- 
globulin. In the other cases, it is replaced by arginine, 
cysteine or glutamic acid residue. The presence of charged 

j amino acids at this position should contribute to making the 
W HH more soluble. 

The replacement by camelid specific residues such as 
those of position 45 appears to be interesting for the con- 
struction of engineered regions derived from the V HH 

s repertoire of 4-chain immunoglobulins. 

A second feature specific of the camelid V ww domain is 
the frequent presence of a cysteine in the CDR 3 region 
associated with a cysteine in the CDR t position 31 or 33 or 
FW 2 region at position 45. The possibility of establishing a 

] disulphide bond between the CDR 3 region and the rest of the 
variable domain would contribute to the stability and posi- 
tioning of the binding site. 

With the exception of a single pathogenic myeloma 
protein (DAW) such a disulphide bond has never been 

5 encountered in immunoglobulin V regions derived from 4 
chain immunoglobulins. 

The heavy-chain immunoglobulins of the invention have 
further the particular advantage of being not sticky. Accord- 
ingly these immunoglobulins being present in the serum, 

3 aggregate much less than isolated heavy chains of a four- 
chain immunoglobulins. The immunoglobulins of the inven- 
tion are soluble to a concentration above 0.5 mg/ml, pref- 
erably above 1 mg/ml and more advantageously above 2 

j These immunoglobulins farmer bear an extensive antigen 
binding repertoire and undergo affinity and specificity matu- 
ration in vivo. Accordingly they allow the isolation and the 
preparation of antibodies having defined specificity, regard- 
ing determined antigens. 
3 Another interesting property of the immunoglobulins of 
the invention is mat they can be modified and especially 
humanized. Especially it is possible to replace all or part of 
the constant region of these immunoglobulins by all or part 
of a constant region of a human antibody. For example the 
5 C^L and/or 0^3 domains of the immunoglobulin could be 
replaced by the 0,2 and/or C^3 domains of the IgG ^3 
human immunoglobulin. 

In such humanized antibodies it is also possible to replace 
a part of the variable sequence, namely one or more of the 
3 framework residues which do not intervene in the binding 
site by human framework residues, or by a part of a human 



Conversely features (especially peptide fragments) of 
heavy-chain immunoglobulin regions, could be intro- 
duced into the V H or V L regions derived from four-chain 
immunoglobulins with for instance the aim of achieving 
greater solubility of the immunoglobulins. 

The invention further relates to a fragment of an immu- 
noglobulin which has been described hereabove and espe- 
cially to a fragment selected from the following group: 
a fragment corresponding to one heavy polypeptide chain 

of an immunoglobulin devoid of light chains, 
fragments obtained by enzymatic digestion of the immu- 
noglobulins of the invention, especially those obtained 
by partial digestion with papain leading to the Fc 
fragment (constant fragment) and leading to FV^^h 
fragment (containing the antigen binding sites of the 
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heavy chains) or its dimer F(V /ffl( h) 2 , or a fragment 
obtained by further digestion with papain of the Fc 
fragment, leading to the pFc fragment corresponding to 
the C-terminal part of the Fc fragment, 
homologous fragments obtained with other proteolytic 
enzymes, 

a fragment of at least 10 preferably 20 amino acids of the 
variable region of the immunoglobulin, or the complete 
variable region, especially a fragment corresponding to 
the isolated V HH domains or to the V HH dimers linked 
to the hinge disulphide, 

a fragment corresponding to the hinge region of the 
immunoglobulin, or to at least 6 amino acids of this 
hinge region, 

a fragment of the hinge region comprising a repeated 

sequence of Pro-X, 
a fragment corresponding to at least 10 preferably 20 
amino acids of the constant region or to the complete 
constant region of the immunoglobulin. 
The invention also relates to a fragment comprising a 
repeated sequence, Pro-X which repeated sequence contains 
at least 3 repeats of Pro-X, X being any amino-acid and 
preferably Gin (glutamine), Lys (lysine) or Glu (acide 
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Such a fragment can be advantageously used as a link 
between different types of molecules. 

The amino-acids of the Pro-X sequence are chosen among 
any natural or non natural amino-acids. 
5 The fragments can be obtained by enzymatic degradation 
of the immunoglobulins. They can also be obtained by 
expression in cells or organisms, of nucleotide sequence 
coding for the immunoglobulins, or they can be chemically 
synthesized. 

io The invention also relates to anti-idiotypes antibodies 
belonging to the heavy chain immunoglobulin classes. Such 
anti-idiotypes can be produced against human or animal 
idiotypes. A property of these anti-idiotypes is that they can 
be used as idiotypic vaccines, in particular for vaccination 
is against glycoproteins or glycolipids and where the carbo- 
hydrate determines the epitope. 

The invention also relates to anti-idiotypes capable of 
recognizing idiotypes of heavy-chain immunoglobulins. 
Such anti-idiotype antibodies can be either syngeneic 
20 antibodies or allogenic or xenogeneic antibodies. 

The invention also concerns nucleotide sequences coding 
for all or part of a protein which amino-acid sequence 
comprises a peptide sequence selected from the following: 



GGSVQTGGSLRLSCE I SGLTFD SEQIDNO:l 

GGSVQTGGSLRLSCAVSGFSFS SEQIDNO:2 

GGSEQGGGSLRLSCAI SGYTYG SEQIDNO:3 

GGSVQPGGSLTLSCTVSGATYS SEQIDNO:4 

GGSVQAGGSLRLSCTGSGFPYS SEQIDNO : 5 

GGSVQAGGSLRLSCVAGFGTS SEQlDNO:6 

GGSVQAGGSLRLSCVSFSPSS SEQIDNO:7 

WGQGTQVTVSS SEQIDNO* 

WGQGTLVTVSS SEQ ID NOS 

WGQGAQVTVSS SEQIDNO:10 

WGQGTQVTASS SEQ ID NO: 11 

RGQGTQVTVSL SEQIDNO:12 

ALQPGGYCGYGX — — — C L SEQ ID NO:62 

V S LMD R I S Q H — — — — — —— — — — — — G C SEQ ID NO:63 

VPAHLGPGA I LDLKKY — K Y SEQ ID NO:64 

FCYS TAGDGGSGE — — — — — — — — — MY SEQ ID NO:65 

EL SGGSCELPLLF — — — — — — — — — DY SEQ ID NO:66 

DWKYWTC GA QTGGY F— — — — — — — G Q SEQ ID NO:67 

RLTEMGACDARWATLATRTFAYNY SEQIDNO:68 

QKKDRTRWAEPREW — — — — — — — — NN SEQ D) NO:® 

GSRFSSPVGSTSRLES — SDY — — NY SEQIDNO:70 

ADPS I YYS I L X I BY — — — — — — — — KY SEQ ID NO:71 

DSPCYMPTMPAPP I RDSFOW — — DD SEQIDNO:72 

T S SFYWYCTTAPY — — — — — — — — — NV SEQ ID NO:73 

TE I EWYGCNLRTTF — — — — — — — — T R SEQ ID NO:74 

NQLAGGWYLDPNYWL S VGAY— — A I SEQ ID NO:75 
RLTEMGACDARWATLATRTFAYNY SEQ ID NO:76 
DGWTRKEGG IGLPWSVQCEDGYNY SEQ ID NO:77 

D S Y P C H L L — — — — — — — — — — — — — — D V SEQ ID NO:78 

VEYP I ADMC S RY SEQ ID NO:79 

APELLGOTSVFVFPPKPKDVLSISaXPK SEQ ID NO:39 

APELPGGPSVFVFPPKPKDVLSISGRPK SEQ ID NO:41 

GQTREPQVYTLAPXRLEL SEQ ID NO:36 

GQPREPQVYTLPPSRDEL SEQ ID NO: 109 

GQPREPQVYTLFPSREEM SEQ ID NO: 110 

GQPREPQVYTLPPSQEEM SEQ ID NO:lll 

or, 

VWSSEPWQPQPmjPQPQPQPKPQPKPEFTCTCPKCPAPEIiGGPSVFIFP SEQ ID NO:101 

GTNEVCKCPKCP SEQ ID NO:37 

APELPGGPSVFVFP SEQ ID NO:45 

EPK1PQPQPKPQPQPQPQPH>QPKPEPECTCPKCP SEQ ID NO:38 

APELLQGPSVHFP SEQ ID NO:46 



65 

glutamique); a particular repeated fragment is composed of Such nucleotide sequences can be deduced from the 
a 12-fold repeat of the sequence Pro-X. amino-acid sequences taking into account the deneneracy of 
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the genetic code. They can be synthesized or isolated from In a particular example, the following 3' primer in which 

cells producing immunoglobulins of the invention. a Kpnl site has been constructed and which corresponds to 

A procedure for die obtention of such DNA sequences is amino-acids 313 to 3 19 (CGC CAT CAA GGT AAC ACT 

described in the examples. TGA) SEQ ID NO:47 is used in conjunction with mouse 

The invention also contemplates RNA, especially nRNA s V ffl primers described by Sestry et al and containing a Xho 

sequences corresponding to these DNA sequences, and also site 
corresponding cDNA sequences. 

The nucleotide sequences of the invention can further be ~ ~ ~ ~~" ^ "~— ■ - - ^ — ■ 

used for the preparation of primes appropriate for the £ ™ «g go crc OAS ja GO gjDjjojj, 
detection in cells or screening of DNA or cDNA libraries to io ao gtc cag ctt ctc gag tct gg seqidno:5o 
isolate nucleotide sequences coding for immunoglobulins of 

the invention. ' TOoim* 



Such nucleotide sequences can be used for the preparation 

of recombinant vectors and the expression of these These primers yield a library of camelid heavy chain 

sequences contained in the vectors by host cells especially 15 immunoglobulins comprising the V m region (related to 

prokaryotic cells like bacteria or also eukaryotic cells and mouse or human subgroup IH), the hinge and a section of 

for example CHO cells, insect cells, simian cells like Vero CH 2 . 

cells, or any other mammalian cells. Especially the fact that in another example, the cDNA is polyadenylated at its 5' 

the immunoglobulins of the invention are devoid of tight end and the mouse specific V HH primers are replaced by a 

chains permits to secrete them in eukaryotic cells since there 20 poly T primer with an inbuilt Xhol site, at the level of 

is no need to have recourse to the step consisting in the nucleotide 12. 
formation of the BIP protein which is required in the 



The inadequacies of the kaown methods for producing ^ same 3. with a ^ ^ is used . 

monoclonal antibodies or immunoglobulins by recombinant 25 generates a j^raiy containing all subgroups 

DNA technology comes from the necessity in the vast Q f immmjoglobulins 

majority of cases to clone simultaneously the \ H and V L Part of me mteres ; in cloning a regioil encompassing the 

domams corresponding to the specific binding site of 4 chain hinge-CH 2 link is that in both Y2 and y3, a Sac site is present 

immunoglobulins. The animals and especially camelids immediately after the hinge. This site allows the grafting of 

which produce hcavy^chain immunoglobulins according to 30 mc sequence for me v ^ ^ onto the Fc 

the invention, and possibly other vertebrate species are K gon of other immunoglobulins, in particular the human 

"O^J* J H ? lucin S hcav y- d, f i11 immunoglobulins of j^j mi ^ which ^ mc samc amino acid sequence at 

which the binding site is located exclusively m the \ m ^ site (G1Um6 l^). m m example, the invention 

domain. Unlike the few heavy-chain immunoglobulins pro- contemplates a cDNA library composed of nucleotide 

Sr^^hiSZ, ^^1°LSL'S " codin S fOT ■ heavy^hain immunoglobulin, such 

cloning, the camelid heavy-chain immunoglobulins have ^ obtauied by performing the following steps: 

undergone extensive niaturatic-n in Wyo. Moreover their V a) ^ a le contaimlI ^ ^ oid ^ es ^ aJ1 

region has naturally evolved to function in absence of the ^ lymphocytes, spleenTlls, lymph noL or 

L ^ T t^J™/? monoclonal ^ fr ^ m a ^ ^nal, espe- 

antioodies by recomlnnant DNA technology. As the obten- 40 ^ mc m ^ t0 

Hon of specific antigen binding clones does not depend on a the lymphoid cells 

stochastic process necessitating a very large number of b) separating polyadenylated RNA from the other nucleic 

recombinant cells, this allows also a much more extensive ' fr~r ^i 9 ^'***?™"- " um m KHUa c 

examination of the repertoire. aads wd components of the cells, 

This can be done at the level of the non rearranged V m 45 c) ™e obtained RNA with a reverse transcriptase 

repertoire using DNA derived from an arbitrarily chosen m order to me corresponding cDNA, 

tissue or cell type or at the level of the rearranged d ) contacting the cDNA of step c) with 5' primers corre- 

repertoire, using DNA obtained from B lymphocytes. More sponding to mouse V„ domain of four-chain 

interesting however is to transcribe the mRNA from anti- immunoglobulins, which primer contains a determined 

body producing cells and to clone the cDNA with or without 50 restriction site, for example an Xhol site and with 3' 

prior amplification into an adequate vector. This will result primers corresponding to the N-terminal part of a C«2 

in the obtention of antibodies which have already undergone ^main containing a Kpnl site, 

affinity maturation. The examination of a large repertoire e ) amplifying the DNA, 

should prove to be particularly useful in the search for f) cloning the amplified sequence in a vector, especially in 

antibodies with catalytic activities. 55 a bluescript vector, 

The invention thus provides libraries which can be gen- g) recovering the clones hybridizing with a probe corre- 

erated in a way which includes part of the hinge sequence, sponding to the sequence coding for a constant domain 

the identification is simple as the hinge is directly attached from an isolated heavy-chain immunoglobulin, 

to the Vjjh domain. This cloning gives rise to clones containing DNA 

These libraries can be obtained by cloning cDNA from 60 sequences including the sequence coding for the hinge. It 

lymphoid cells with or without prior PCR amplification. The thus permits the characterization of the subclass of tee 

PCR primers are located in the promoter, leader or frame- immunoglobulin and the Sad site useful for grafting the 

work sequences of the for the 5' primer and in the FV^^h to the Fc region. 

hinge, C„2, C„3, 3' untranslated region or polyA tail for the The recovery of the sequences coding for the heavy-chain 

3' primer. A size selection of amplified material allows the 65 immunoglobulins can also be achieved by the selection of 

construction of a library limited to heavy chain immuno- clones containing DNA sequences having a size compatible 

globulins. with the lack of the C^l domain. 
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It is possible according to another embodiment of the 
invention, to add the following steps between steps c) and d) 
of the above process: 

in the presence of a DNA polymerase and of deoxyribo- 
nucleotide triphosphates, contacting said cDNA with 
oligonucleotide degenerated primers, which sequences 
are capable of coding for the hinge region and 
N-terminal \ HH domain of an immunoglobulin, the 
primers being capable of hybridizing with the cDNA 
and capable of initiating the extension of a DNA 
sequence complementary to the cDNA used as 



The invention also relates to a process for the preparation 
of a monoclonal antibody directed against a determined 
antigen, the antigen binding site of the antibody consisting 
of heavy polypeptide chains and which antibody is further 
i devoid of light polypeptide chains, which process com- 
prises: 

immortalizing lymphocytes, obtained for example from 
the peripheral blood of Camelids previously immu- 
nized with a determined antigen, with an immortal cell 
o and preferably with myeloma cells, in order to form a 



recovering the amplified DNA 

The clones can be expressed in several types of expres- 
sion vectors. As an example using a commercially available 1 
vector Immuno PBS (Huse et al: Science (1989) 246, 1275), 
clones produced in Bluescript® according to the above 
described procedure, are recovered by PCR using the same 
Xhol containing 5' primer and a new 3' primer, correspond- 
ing to residues 113-103 in the framework of the 
immunoglobulins, in which an Spe site has been con- 
structed: TC TTA ACT ACT GAG GAG ACG GTG ACC 
TG SEQ ID NO:51. This procedure allows the cloning of the 
V m in the Xho/Spe site of the Immuno PBS vector. 
However, the 3' end of the gene is not in phase with the 2 
identification "tag" and the stop codon of the vector. To 
achieve this, the construct is cut with Spe and the 4 base 
overhangs are filled in, using the Klenow fragment after 
which the vector is religated. A further refinement consists 
in replacing the marker ('tag") with a poly histidine so that 3 
metal purification of the cloned V„„ can be performed. To 
achieve this a Spe/EcoRI double stranded oligonucleotide 
coding for 6 bistidines and a termination codon is first 
constructed by synthesis of both strands followed by heating 



culturing the immortalized cells (hybridoma) formed and 
recovering the cells producing the antibodies having 
the desired specificity. 
; The preparation of antibodies can also be performed 
without a previous immunization of Camelids. 

According to another process for the preparation of 
antibodies, the recourse to the technique of the hybridoma 
cell is not required, 
i According to such process, antibodies are prepared in 
vitro and they can be obtained by a process comprising the 
steps of: 

cloning into vectors, especially into phages and more 
particularly filamentous bacteriophages, DNA or 
> cDNA sequences obtained from lymphocytes espe- 
cially PBLs of Camelids previously immunized with 
determined antigens, 
transforming prokaryonc cells with the above vectors in 
conditions allowing the production of the antibodies, 
' selecting the antibodies for their heavy-chain structure 
and further by subjecting them to antigen-affinity 



recovering the antibodies having the desired specificity, 
In another embodiment of the invention the cloning is 
performed in vectors, especially into plasmids coding for 



CTACTG CAC CAC CAT CAC CAT CAC TAA* TAG* SEQ ID N0.52 

AC GTG GTG GTA GTG OTA GTG ATT ATCTTAA SEQ ID NO:53 



The vector containing the insert is then digested with Spel 
and EcoRI to remove the resident "tag" sequence which can 
be replaced by the poly-His/termination sequence. The pro- 
duced V HH can equally be detected by using antibodies 
raised against the dromedary Vg, regions. Under laboratory 
conditions, V ra regions are produced in the Immuno PBS 
vector in mg amounts per liter. 

The invention also relates to a DNA library composed of 
nucleotide sequences coding for a heavy-chain 
immunoglobulin, such as obtained from cells with rear- 
ranged immunoglobulin genes. 

In a preferred embodiment of the invention, the library is 
prepared from cells from an animal previously immunized 
against a determined antigen. This allows the selection of 
antibodies having a preselected specificity for the antigen 
used for immunization. 

of the cDNA is not performed prior to the cloning of the 
cDNA. 

The heavy-chain of the four-chain immunoglobulins 
remains sequestered in the cell by a chaperon protein (BIP) 
until it has combined with a light chain. The binding site for 
the chaperon protein is the C w l domain. As this domain is 
absent from the heavy chain immunoglobulins, their secre- 
tion is independent of the presence of the BIP protein or of 
the light chain. Moreover the inventors have shown that the 
obtained immunoglobulins are not sticky and accordingly 
will not abnormally aggregate. 



bacterial membrane proteins. Procaryotic cells are then 
transformed with the above vectors in conditions allowing 
the expression of antibodies in their membrane. 
45 The positive cells are further selected by antigen affinity 
selection. 

The heavy chain antibodies which do not contain the C H 1 
domain present a distinct advantage in this respect Indeed, 
the C„l domain binds to BIP type chaperone proteins 
present within eukaryotic vectors and the heavy chains are 
not transported out of the endocytoplasmic reticulum unless 
light chains are present This means that in eukaryotic cells, 
efficient cloning of 4-chain immunoglobulins in non mam- 
malian cells such as yeast cells can depend on the properties 
of the resident BIP type chaperone and can hence be very 

55 difficult to achieve. In this respect the heavy chain antibodies 
of the invention which lack the C H 1 domain present a 
distinctive advantage. 

In a preferred embodiment of the invention the cloning 
can be performed in yeast either for the production of 

60 antibodies or for the modification of the metabolism of the 
yeast As example, Yep 52 vector can be used. This vector 
has the origin of replication (ORI) 2u of the yeast together 
with a selection marker Leu 2. 
The cloned gene is under the control of gall promoter and 

65 accordingly is inducible by galactose. Moreover, the expres- 
sion can be repressed by glucose which allows the obtention 
of very high concentration of cells before the induction. 
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The cloning between BamHI and Sail sites using the same The invention also provides hetero- specific antibodies 

strategy of production of genes by PCR as the one described which can be used for the targetting of drugs or any 

above, allows the cloning of camelid immunoglobulin genes biological substance like hormones. In particular they can be 

in E. coli. As example of metabolic modulation which can use< i to selectively target hormones or cytokines to a limited 

be obtained by antibodies and proposed for (he yeast, one 5 category of cells. Examples are a combination of a murine 

can site the cloning of antibodies directed against cyclins, or human antibody interleuMn 2 (JL^ and a 

that is proteins involved _ in to j regulation of the cellular heavy -chain antibody raised against CD 4 cells. This could be 

cycle of the yeast fTlBS 16 430 J. D. Mc Kinney N. Heintz used to £ D ^ which ^ ^ ^ 

1991). Another example is the introduction by genetic receD t 0 r 

the genome of the yeast The CD M is involved at the level be used for the preparation of hetero-speofic antibodies, 

of the initiation of cell division, and therefore the expression These can be achieved either according to the above 

of antibodies against this molecule would allow an efficient described method by reduction of the badges between the 

control of multiplication of the cells and the optimization of different chains and reoxydation, according to the usual 

methods for the production in bioreactors or by means of 15 techniques, of two antibodies having different specificities, 

immobilized cells. but it can also be achieved by serial cloning of two anti- 

In yet another embodiment of the invention, the cloning bodies for instance in the Immuno pBS vector, 

vector is a plasmid or a eukaryotic virus vector and the cells In such a case, a first gene corresponding to the V HH 

to be transformed are eukaryotic cells, especially yeast cells, domain comprised between Xho site and a Spe site is 

mammalian cells for example CHO cells or simian cells 20 prepared as described above. A second gene is then prepared 

such as Veto cells, insect cells, plant cells, or protozoan through an analogous way by using as 5' extremity a primer 

cells. containing a Spe site, and as 3' extremity a primer containing 

For more details concerning the procedure to be applied a termination codon and an EcoRI site. The vector is then 

in such a case, reference is made to the publication of Marks digested with EcoRI and Xhol and further both V„, genes 

et al, J. Mol. Biol. 1991, 222:581-597. 25 are digested respectively by Xho/e and by Spe/EcoRL 

Furthermore, starting from the immunoglobulins of the After ligation, both immunoglobulin genes are serially 

invention, or from fragments thereof, new immunoglobulins cloned. The spacing between both genes can be increased by 

or derivatives can be prepared. the introduction of addition codons within the 5' Spel primer. 

Accordingly immunoglobulins replying to the above In a particular embodiment of the invention, the hinge 

given definitions can be prepared against determined and- 30 region of IgG2 immunoglobulins according to the invention 

gens. Especially the invention provides monoclonal or poly- is semi-rigid and is thus appropriate for coupling proteins. In 

clonal antibodies devoid of light polypeptide chains or such an application proteins or peptides can be linked to 

antisera containing such antibodies and directed against various substances, especially to ligands through the hinge 

determined antigens and for example against antigens of region used as spacer. Advantageously the fragment com- 

pathological agents such as bacteria, viruses or parasites. As 35 prises at least 6 amino acids. 

example of antigens or antigenic determinants against which According to the invention it is interesting to use a 

antibodies could be prepared, one can cite the envelope sequence comprising a repeated sequence Fro-X, X being 

glycoproteins of viruses or peptides thereof, such as the any amino-acid and preferably Gin, Lys or Glu, especially a 

external envelope glycoprotein of a HTV virus, the surface fragment composed of at least a 3-fold repeat and preferably 

antigen of the hepatitis B virus. 40 of a 12-fold repeat, for coupling proteins to ligand, or for 

Immunoglobulins of the invention can also be directed assembling different protein <k 

against a protein, hapten, carbohydrate or nucleic acid. The hinge region or a ft 

Particular antibodies according to the invention are for coupling proteins to ligands or for assembling different 

directed against the galactosyla- 1-3-galactose epitope. protein domains. 

The immunoglobulins of the invention allow further the 45 Usual techniques for the coupling are appr op riate and 

preparation of combined products such as the combination especially reference may be made to the technique of protein 

of the heavy-chain immunoglobulin or a fragment thereof engineering by assembling cloned sequences, 

with a toxin, an enzyme, a drug, a hormone. The antibodies according to this invention could be used 

As example one can prepare the combination of a heavy- as reagents for the diagnosis in vitro or by imaging tech- 
chain immunoglobulin bearing an antigen binding site rec- 50 niques. The immunoglobulins of the invention could be 
ognizing a myeloma immunoglobulin epitope with the abrin labelled with radio-isotopes, chemical or enzymatic markers 
or mistletoe lectin toxin. Such a construct would have its or chemiluminescent markers. 

uses in patient specific therapy. As example and especially in the case of detection or 

Another advantageous combination is that one can pre- observation with the immunoglobulins by imaging 

pare between a heavy-chain immunoglobulins recognizing 55 techniques, a label like technetium, especially technitium 99 

an insect gut antigen with a toxin specific for insects such as is advantageous. This label can be used for direct labelling 

the toxins of the different serotypes of Bacillus thuringiensis by a coupling procedure with the immunoglobulins or 

or Bacillus sphaericus. Such a construct cloned into plants fragments thereof or for indirect labelling after a step of 

can be used to increase the specificity or the host range of preparation of a complex with the technitium. 

existing bacterial toxins. 60 Other interesting radioactive labels are for instance 

The invention also proposes antibodies having different indium and especially indium 111, or iodine, especially I 131 , 

specificities on each heavy polypeptide chains. These I 12 * and I 123 . 

multifunctional, especially bifunctional antibodies could be For the description of these techniques reference is made 

prepared by combining two heavy chains of immunoglobu- to the FR patent application published under number 

lins of the invention or one heavy chain of an immunoglo- 65 2649488. 

bulin of the invention with a fragment of a four-chain model In these applications the small size of the W lm fragment 

advantage for penetration into tissue. 
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The invention also concerns monoclonal antibodies react- The cells are recovered after two days and the presence of 

ing with anti-idiotypes of the above-described antibodies. V ffl regions is determined with an ELBA assay by using 

The invention also concerns cells or organisms in which rabbit antiserum reacting with the V HH part 
heavy-chain immunoglobulins have been cloned. Such cells The invention further allows the preparation of catalytic 
or organisms can be used for the purpose of producing 5 antibodies through different ways. The production of anti- 
heavy-chain immunoglobulins having a desired preselected directed against components mimicking activated 
specificity, or corresponding to a particular repertoire. They sta tes of substrates (as example vanadate as component 
can also be produced for the purpose of modifying the mimicking me activated state of phosphate in order to 
metabolism of the cell which expresses them. In the case of ^ mdr phosphoesterase activities, phosphonate as 
modificattonof the membohsm of cells transformed wife the und me bind ing in order to pro- 
sequences coding for heavy-chain ^immunoglobulins, these du( ^ oteases) t0 Xin antibodies having a cata- 
produced heavy-chain immunoglobulins are used like anti- , £ ~7 ■ iT ZuZl- 
sense DNA. Antisense DNA is usually involved in blocking ^ toctI ° n - Aaoths[ wa y t0 obtam ^ch antibodies con- 
the expression of certain genes such as for instance the Sls * ™ Planning a random mutagenesis in clones of 
variable surface antigen of trypanosomes or other patho- antibodies for example by PCR, in introducing abnormal 
gens. Likewise, the production or the activity of certain 15 bases durm 6 *e amplification of clones. These amplified 
proteins or enzymes could be inhibited by expressing anti- fragments obtained by PCR are then introduced within an 
bodies against mis protein or enzyme within the same cell. appropriate vector for cloning. Their expression at the 
The invention also relates to a modified 4-chain immu- surface of the bacteria permits the detection by the substrate 
noglobulin or fragments thereof, the V„ regions of which of clones having the enzymatic activity. These two 
has been partialy replaced by specific sequences or amino 20 approaches can of course be combined. Finally, on the basis 
acids of heavy chain immunoglobulins, especially by of the data available on the structure, for example the data 
sequences of the Y HH domain. A particular modified V H obtained by XRay crystallography or NMR, the modifica- 
domain of a four-chain immunoglobulin, is characterized in tions can be directed. These modifications can be performed 
that the leucine, proline or glutamine in position 45 of the by usual techniques of genetic engineering or by complete 
V H regions has been replaced by other amino acids and ^ synthesis. One advantage of the V^, of the heavy chain 
preferably by arginine, glutamic acid or cysteine. immunoglobulins of the invention is the fact that they are 

A further modified V„ or W L domain of a four-chain sufficiently soluble, 

immunoglobulins charactered by linMng of CDR loops ^ h cnain immunoglobulins of the invention can 



together or to FW regions by the introduction of paired 
cysteines, the CDR region being selected between the CDR, 
and the CDR 3 , the FW region being the FW 2 region, and 
especially in which one of the cysteines introduced is in 



further be produced in plant cells, especially in transgenics 
) plants. As example the heavy chain immunoglobulins can be 
produced in plants using the pMon530 plasmid (Roger et aL 



position 31, 33 of Ihe CDR, or 45 of FW, and the other in Meth Enzvm 153 1566 1987) constitutive plant expression 
' vector as has been described for classical four chain anti- 



CDR 3 . 



Esp<xiaUy me mtrowction of paired cysteines is such that bodies (Hiatetal. Nature 342 76-7g, 1989) once again using 
the CDRj loop is linked to the FW2 or CDR1 domain and 35 me appropriate PCR primers as described above, to generate 
mOTe especially the cysteine of the CDR3 of the V„ is linked a DNA fragment in the right phase, 
to a cysteine in position 31, 33 of the CDRj or in position Other advantages and characteristics of the invention will 
45 of FW 2 . become apparent in the examples and figures which follow. 

In another embodiment of the invention, plant cells can be FIGURES 
modified by the heavy-chain immunoglobiUins according to 40 

the invention, in order that they acquire new properties or FIG. 1(A), (B), and (C): Characterisation and purification 
increased properties. of camel IgG by affinity chromatography on Protein A and 

The heavy-chain immunoglobulins of the invention can Protein G sepharose (Pharmacia) 
be used for gene therapy of cancer for instance by using Figure (A) shows, after reduction, Ihe SDS-PAGE protein 
antibodies directed against proteins present on the tumor 45 profile of the adsorbed and non adsorbed fractions of Corn- 
cells, eius dromedarius serum. The fraction adsorbed on Protein A 

In such a case, the expression of one or two genes and eluted with NaCl 0.15M acetic acid 0.58% show upon 
can be obtained by using vectors derived from parvo or reduction (lane c) three heavy chain components of respec- 
adeno viruses. The parvo viruses are characterized by the tively 50, 46 and 43 Kd and light chain (rabbit IgG in lane 
fact that they are devoid of pathogenicity or almost not so a). The fractions adsorbed on a Protein G Sepharose 
pathogenic for normal human cells and by the fact that they (Pharmacia) derivative which has been engineered to delete 
are capable of easily multiplying in cancer cells (Russel S. the albumin binding region (lane e) and eluted with 0. 1M gly 
J. 1990, Immunol. Today IL 196-200). HQ pH 2.7 lacks the 4G Kd heavy chain which is recovered 

The heavy-chain immunoglobulins are for instance in the non adsorbed fraction (lane f). None of these com- 
cloned within Hindm/Xbal sites of the infectious plasmid of 55 ponents are present in the fraction non adsorbed on Protein 
Ihe murine MVM virus (pMM984). (Merchlinsky et al, A (lane d), lane b contains the molecular weight markers. 
1983, J. ViroL 47, 227-232) and then placed under the Figures (B) and (C) By differential elution, immunoglo- 
control of the MVM38 promoter. bulin fractions containing the 50 and 43 Kd heavy chain can 

The gene of the domain is amplified by PCR by be separated. 5 ml of C. dromadarius serum is adsorbed onto 
using a 5' primer containing an initiation codon and a 60 a 5 ml Protein G sepharose column and the column is 
HindHI site, the 3' primer containing a termination codon extensively washed with 20 mM phosphate buffer, pH 7.0. 
and a Xbal site. Upon elution with pH 3.5 buffer (0.15M Nad, 0.58% acetic 

This construct is then inserted between positions 2650 acid) a 100 Kd component is eluted which upon reduction 
(Hindm) and 4067 (Xbal) within the plasmid. yields a 43 Kd heavy chain, (lane 1). After column eluant 

The efficiency of the cloning can be checked by transfec- 65 absorbance has fallen to background level a second immu- 
tion. The vector containing die antibody is then introduced noglobulin component of 170 Kd can be eluted with pH 2.7 
in permissive cells (NB-E) by transfection. buffer (0.1M glycine HC). This fraction upon reduction 
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yields a 50 Kd heavy chain and a board light chain band FIG. 4(A) and (B): Purified IgG3 of the camel, by affinity 

(lane 2). The fraction non adsorbed on Protein G is then chromatography on Protein A Sepharose are partially 

brought on a 5 ml Protein A Sepharose column. After digested with papain and separated on Protein A sepharose. 

washing and elution with pH 3.5 buffer (0.15MN ? a, 0.58% 14 of IgG3 were m q.im phosphate 

,/T T,T,t r V L 1 , L, ' , by 1 hour incubation at 37° C. with mercurypapain (1% 

FIG. 2(A) and (B): Immunoglobulins of Camelus c to ^ ^ activated b me ^ 

10 ul of serum obteined from the different species were 10 £ endorf _ CCntnfu S e f ^ 5nUn ^^T^f P f 

addedtoEppendorr-tubescontaining lOmg of ftotein Acr S^^^ 1 ^^ t^^^^Tt 

Protein G Sepharose suspended in 400 ulof pH 8.3 immu- ^.'"t"* (B) " Bd , n ?° ^1 ^ 

noprecipitaton buffer (Nad 0.2. M, Tris 001M; EDTA bindung^cdon was ehttod from the column with 0.1M 

0.01M, Triton X100 1%, ovalbumin 0.1%). The tubes were 15 glycine HU DUtter pH U 

slowly rotated for 2 hours at 4° C After centrifugatJon me nG S: Schematic presentation of a model for IgG3 

pellets were washed 3 times in buffer and once in buffer in molecules devoid of light chains. 

which the Triton and ovalbumin had been ommitted. The FIG. 6: Schematic representation of immunoglobulins 

pellets were then resuspended in the SDS-PAGE sample having heavy polypeptide chains and devoid of light chains, 

solution 70 ul per pellet with or without dithiotreitol as ^ regarding conventional four-chain model immunoglobulin, 

reductant. After boiling far 3 min at 100° C, the tubes were Representation of a hinge region. 

centrifuged and the supernatants analysed. In all species FIG. 7: Alignement of 17 V ffl DNA sequences of Camel 

examined the unreduced fractions Figure (A) contain in heavy chain immunoglobulins SEQ ID NOS:92-108. 

addition to molecules of approximately 170 Kd also smaller mG g. Expression and purification of the camel V^l 

major components of approximately 100 Kd. In the reduced ^ from E coU 
sample Figure (B) the constituant heavy and light chains are 

detected. In all species a heavy chain component (marked by I HEAVY CHAIN ANTIBODIES IN CAMEUDS 
an asterisk *) is present in the material eluted from the 

Protein A but absent in the material eluted from the Protein „ ™ en Camehi 5 dromedarius serum is adsorbed on Protein 

q G sepharose, an appreciable amount (25-35%) of lmmuno- 

mn i/*vnvn. „„j „, m „„„a 30 globulins (Ig) remains in solution which can then be recov- 

FIG. 3(AXBXQ: IgG* and LjG 3 were prepared £ ^ chromatography on Protein A sepharose 

from serum obtained from healthy or Trypanosoma evans, "\ ^^TZiZ^La ™ p^j^ r> ™„ iU 

infected Camelus dromedarius (CATT titer 1/160 (3) and ^ ^ ***** °" ^ * 

analysed by mdioirnnmnotm 5 cirLtion or Western Blotting differentially elated into a tightly bound fraction (25%) 

„ . ■ ■. consisting of molecules of an unreduced apparent molecular 

for anti trypanosome activity 35 ^"J^ of 170 Kd and a more weakly bound fraction 

Figure (A) S metiwomne labelled Trypanosome evansi (3( ^ 5%) having m appMCBl molecul!a ^ of 100 K d 

antigens lysate (500.000 counts) was added to Eppendorf IB). Thel70 Kdcomponent when reduced yields 50 

tobes containing ; 10 g of s^^ ^ of^} ^ ^ Kd heavy chains and large 30 Kd tight chains. The 100 Kd 

^ irnmimopreapitetion buffer con- ^ toi]dfy of light chains and appears to be 

taming 0.1M TLCK as proteinase inhibitor and slowly 40 so i e i y composed of heavy chainTwhich after redaction have 

rotated at 4° C . during onehour. The tubes were then 0 „ ^JZmW of 43 Kd (FIG. 1C). The fraction which 

supplemented with 10 mg of Protein A Sepharose suspended doesnot bind to Protein G can be affinity purified and eluted 

in 200 ul of the same pH 83 buffer and incubated at 4= C from a Protein A column as a seSdioO Kd component 

foran additional hour. After washing and centafogation « whidl ^ lMaB s to ^ sol ~£l f 45 

15000 rpm for 12 s, each pellet was resuspended in 75 ul 45 hcavy chams 1 

SDS-PAGE sample solution containing DTT and heated for _ . 

3 min. at 100° C. After centrifugation in an Eppendorf ^ heavv cham nnmoglobuhns devoid of light chams 
minifuge at 15000 rpm for 30 s, 5 ulof the supernatant was total up to 75% of the molecules binding to Protein A 
saved for radioactivity detennination and the reminder As all three iinmunoglobulins bind to Protein A we refer 
analysed by SDS-PAGE and fluorography. The counts/5 ul 50 to 6x111 88 ^ namely IgGi (fight chain and heavy chain yl 
sample are inscribed on for each line. (50 Kd) binding to Protein G, IgG 2 (heavy chain y2 (46 Kd) 
Figure (B) (C) 20 ugoflgG., IgG, and IgG, from healthy 11011 bindm g to Plotdn G *8°3 &e*<ry chain y3 (43 Kd) 
and trypanosome infected animals were separated by SDS- bindin 8 to G ™ a * is a possibility that these three 
PAGE without prior reduction or heating. The separated subclasses) can be further subdivided 
samples were then electro transferred to a nitrocellulose 55 A comparative study of old world camelids (Camelus 
membrane, one part of the membrane was stained with bactrianus and Camelus dromedarius) and new world cam- 
Ponceau Red to localise the protein material and the elids (lama pacos, lama glama, lama vicuqna) showed that 
reminder incubated with 1% ovalbumin in TST buffer (Tris heavy chain immunoglobulins are found in all species 
10 mM, NaQ 150 mM, Tween 0.05%) to block protein examined, albeit with minor differences in apparent molecu- 
binding sites. After blocking, the membrane was extensively 60 1» weight and proportion. The new world camelids differs 
washed with TST buffer and incubated for 2 hours with from the old world camelids in having a larger LjG 3 mol- 
33 S-labelled trypanosome antigen. After extensive washing, ecule (heavy chain immunoglobulin binding to Protein G) in 
the membrane was dried and analysed by autoradiography. which the constituant heavy chains have an apparent 
To avoid background and unspecific binding, the labelled molecular weight of 47 Kd (FIG. 2A and B). 
trypanosome lysate was filtered through a 45u mnfipore 65 The abundance of the heavy chain immunoglobulins in 
filter and incubated with healthy camel immunoglobulin and the serum of camelids raises the question of what their role 

is in the immune response and in particular whether they 
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bear antigen binding specificity and if so how extensive is immunoglobulin heavy chain binding protein or BIP (11), 

the repertoire. This question could be answered by exam- which normally is replaced by the light chain (12). It is 

ining the immunoglobulins from Trypanosoma evansi possible that the primordial role of the light chain in the 

infected camels (Camehis dromedarius). four-chain model immunoglobulins is that of a committed 
For mis purpose, the corresponding fractions of IgG^ 5 heavy chain chaperon and that the emergence of light chain 

IgG 2 , IgG 3 were prepared from the serum of a healthy camel repertoires has just been an evolutionary bonus, 

and from the serum of camels with a high antitiypanosome The camelid yl and y3 chains are considerably shorter 

titer, measured by the Card Agglutination Test (3). In radio- than the normal mammalian y chain. This would suggest that 

immunoprecipitation, IgG!, IgG 2 and IgG 3 derived from deletions have occurred in the C H 1 domain. Differences in 
infected camel indicating extensive repertoire heterogeneity 10 sizes of the y2 and y3 immunoglobulins of old and new 

and complexity (FIG. 3A) were shown to bind a large world camelids suggests that deletions occurred in several 

number of antigens present in a 3S S methionine labelled evolutionary steps especially in the C H l domain, 

trypanosome lysate. H THE HEAVY CHAIN IMMUNOGLOBULINS 

^P\S^J£ZZ££SZZ° of the camelids lack the c h i domain. 

IgG 3 obtained from infected animals (FIG. 3B and C). The strategy followed for investigating the heavy chain 

This leads us to conclude that the camelid heavy chain immunoglobulin primary structure is a combination of pro- 

IgG 2 and IgG 3 are bona fide antigen binding antibodies. **** and cDNA sequencing; the protein sequencing is nec- 
An immunological paradigm states that an extensive M f 8881 ? to identify sequence streches characteristic of each 

antibody repertoire is general by the combination of the 1?™^°^; ™* N-termmal of the nnmunoglobukn 

tight and hiavy chain variable V region repertoires (6). The ^mg derived from the heavy chain variable region reper- 

hSvy chain iirUnoglobulins of mtcamelseem to contra- olre ^ V^i ^T* 10 * °? * 6 V ™ 

(variable region of the heavy chain) and cannot be used for 
class or subclass identification. This means that sequence 



diet this paradigm. 



Irrmiunoglotahns are characterized by a complex LE.F. 25 ^to had to be obtained from internal enzymatic or chemical 
(isoelectric focussing) pattern reflecting their extreme het- hmv»o« sites 



erogeneity. To determine whether the two heavy chains , . . _ . . ~ 

constituting the IgG, and IgG, are identical or not, the * combma *° n digestion and Proton A affinity 

isoeTectncLussSaJHF.Sern were observed before chromatography allowed the separation of various frag- 



isoelectric focussing (LE.F.) pattern were observed before 
and after chain separation by reduction and alkylation using 



ments yielding information on the general structure of IgG3. 



iodoacetamide as alkylating agent ^ *g G3 of camel (Camelus dromedarius) purified by 



As this alkylating agent does not introduce additional 
charges in the molecule, the monomers resulting from the 



affinity chromatography on Protein A Sepharose were par- 
tially digested with papain and the digest was separated on 
Protein A Sepharose into binding and non binding fractions. 



reduction and alkylation of a heavy chain homodirner will " /* J ^* UUU3C !"™7* Zr^ ^rT^ T 
f. "r ,, ./ , , " „ These fractions were analysed by SDS PAGE under reduc- 

have practically the same isolectnc point as the dimer, 35 - - ' -. 

whereas if they are derived from a heavy chain heterodimer, 
the monomers will in most cases differ sufficiently in iso- 



ing and non reducing conditions (FIG. 4A and B). 
The bound fraction contained two components, one of 28 

decttic"p^mtVg^e^Tdiff^rent patera mljEjT Md J OM rf " """H? to UDd, ?? Bd "I*^ 

cleaved material. They were well separated by gel electro- 



Upon reduction, and alkylation by iodoacetamide the 



J ,„ . , _ , , w phoresis (from preparative 19% SDS-PAGE gels) under non 



edarius IgG 2 and IgG 3 indicating that these molecules 



„. M , . m .. ... m . «^n (in 50 nM amonium bicarbonate, 0.1% (w/v) SDS using 

each composed of two identical heavy chains which migrate a BioRad e lectro-eluter). After lyophilization of these elec- 
ta the same position as the unreduced molecule they ongi- - ~ 
nated from. 



troeluted fractions, the remaining SDS was eliminated by 

45 precipitating the protein by the addition of 90% ethanol, 

In contrast, the LE.F. pattern of IgG x is completely rnixmgainlmcubalmgffieimxture overnight at -20° C. (14). 

modified after reduction as the isoelectric point of each The precipitated protein was collected in a pellet by centri- 
molecukisdeterrninedbyffieccml>uiationofmeisodectri (15000 rpm, 5 min) and was used for protein 

points of the tight and heavy chains which after separation sequencing. N-terminal sequencing was performed using the 

will each migrate to a different position. x automated Edman chemistry of an Applied Biosystem 477A 

These findings indicate that the heavy chains alone can pulsed liquid protein sequencer. Amino acids were identified 

generate an extensive repertoire and question the contribu- as their phenyltniohydantoin (PTH) derivatives using an 

tion ofme tight chain to the usefiUantilx)dyreDmoffe. If this Applied Biosystem 120 PTH analyser. All chemical and 

necessity be negated, what other role does the light chain reagents were purchased from Applied Biosystems. Analysis 

play. 55 of the chromatographic data was performed using Applied 

Normally, isolated heavy chain from mammalian immu- Biosystems software version 1.61. In every case the com- 

noglobulins tend to aggregate considerably but are only puter aided sequence analysis was cofirmed by direct inspec- 

solubOized by light chains (8, 9) which bind to the C w l tion of the chromatograms from the PTH analyser. Samples 

domain of the heavy chain. for protein sequencing were dissolved in either 50% (v/v) 

In humans and in mice a number of spontaneous or 60 trifluoroacetic addtTFA) (28Kd fragment) or 100% TEA. 

induced myelomas produce a pathological immunoglobulin (14Kd fragment). Samples of dissolved protein equivalent to 

solely composed of heavy chains (heavy chain disease). 2000 pmol (28 Kd fragment) or 500 pmol (14Kd fragment) 

These myeloma protein heavy chains carry deletions in the were applied toTFA-treated glass fibre discs. The glass fibre 

C H 1 and \ lm domains (10). The reason why full lenght discs were coated with BioBrene (3 mg) and precycled once 

heavy chains do not give rise to secreted heavy chain in such 65 before use. 

pathological immunoglobulins seems to stem from the fact N-terminal sequencing of the 28 Kd fragment yields a 

that the synthesis of Ig involves a chaperoning protein, the sequence homologous to the N-terminal part of y C«2 
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domain and hence to the N-terminal end of the Fc fragment the amplified cDNA was then digested with EcoRI and Kpnl 

The N-terminal sequence of the 14.4 Kd fragment corre- for series #56 clones and with Xhol and Kpnl for series #72 

sponds to the last lysine of a y C H 2 and the N-terminal end clones. A final phenol-chloroform extraction preceded the 

of a y Ch3 domain (Table 1). The molecular weight (MW) ligation into pUC 1 8( series #56 clones) or into pBluescript* 

of the papain fragments and the identification of their 5 (series #72 clones). 

N-terminal sequences led us to conclude that the C H 2 and All the clones obtained were smaller that the 860 base 

C«3 domains of the y3 heavy chains are normal in size and pairs to ^ expected if they possessed a complet V„„ and 

mat the deletion must occur either in the C H \ or in the q h I region. Partial sequence data corresponding to the 

domain to generate the shorted y3 chain. The fractions which N-terminal of the region reveals that out of 20 clones, 

donotbmdtoErotemASepharosecontaintwobandsof34 10 3 were identical and possibly not independent. The 

and 17 Kd which are more diffuse is SDS PAGE indicating sequences obtained ressemble the human subgroup m and 

mat they originate from the variable N-terminal part of the the murine subgroups ma and mb (Table 2) 

molecule (FIG. 4A and B). aones CQfresponding |0 ^ maent sets of Q1 ^ n 

Upon reduction, a single diffuse band of 17 Kd is found sequences were obtained. A first set of sequences (#72/41) 

indicating that the 34 Kd is a disulfide bonded dimer of the " had a N4emiilia i K g m to me one obtained by 

17 Kd component The 34 Kd fragment apparently contains protein sequencing of the 28 Kd papain fragments of the y3 

the hinge and the N-terminal domain V m . The protein heavy chain, a short hinge region containing 3 cysteines and 

sequence data can be used to construct degenerate oligo- a variable regioil corresponding to the framework (FR4) 

nucleotide primers allowing PGR amplification of cDNA or residues encoded by the J mimgenes adjoining the hinge, 

genomic DNA. 20 ^ ^ domain fa entjrely kddng ^ cDNA amipoias 

It has been shown that the cells from camel spleen imprint to the y3 chain (Table 4). 

cells reacted with rabbit and anti camel mmunoglobulin in one closely related sequence (#72/1) the proline in 

sera and that the spleen was hence a site of synthesis of at position 259 is replaced by threonine. 



least one immunoglobulin class. cDNA was therefore syn- 



thetised from camel spleen mRNA The conditions for the 25 ^S^^S^S^^^S^ 
isolation ofRNA were me following total RN A was isolated 255 me C ^ W f °! ^ C °NA using as 



isolation of RNA were the following: total RNAwas isolated 



from the dromedary spleen by me guanidium isothiocyanate gJiSSS L?5£5. £5£ TEZSSZTJZ 

method (15). mRNA was purified with oligo T-paramagnetic * mSa ? d . at me 5 , en ^ Thc to ^ ? clm ° 

template, an oligodT primer and reverse transcriptase n r_.. 

(BOERfflNGER MAN). Second strand cDNA is obtained P _ 

using RNAse H and E coti DNA polymerase I according to ^ sec ! uence of this y3 chain presents similarities with 

the condition given by the supplier. Relevant sequences other 7 chains except mat it lacks the C„l domain, the V„„ 

were amplified by PGR: 5 ng of cDNA was amplified by domaln ^"8 ^J^t to 

PCR in a 100 ul reaction mixture (10 mM Tris-HQ pH 8.3, 35 One or all three of the cysteines could be probably 

50 mM KO, 15 mM Mgd 2 , 0.01% (w/v) gelatine, 200 uM responsible for holding me two y3 chains together, 

of each dNTP and 25 pinoles of each primer) overlaid with These results have allowed us to define a model for the 

mineral oil (Sigma). Degenerate primers containing EcoRI IgG3 molecule based on sequence and papain cleavage 

and Kpnl sites and further cloned into pUC 18. After a round (FIG. 5). 

of denaturing and annealing (94° C. for 5 min and 54° C. for 40 Papain can cleave the molecule on each side of the binge 

5 min), 2 units of Taq DNA polymerase were added to the disulfides and also between C„2 and C*3. Under non 

reaction mixture before subjecting it to 35 cycles of ampli- reducing conditions the V m domains of IgG3 can be 

fication:l min at 94° C. (denature) 1 min at 54° C. (anneal), isolated as disulfide linked dimer or as monomer depending 

2 min at 72° C (elongate). To amplify DNA sequences on the site of papain cleavage. 

between V«, and C„2 domains , (#72 clones), the PCR was 45 A XCOB& set of doacs #72/29 had a slightly different 

^Z^^^Zl. wL ^^^T 011 fOT me C ^ ™ by a very long 

annealing temperature was increased to 60° C hinge immediately preceded by the variable domain. This 

One clone examined (#56/36) had a sequence correspond- hinge region has 3 cysteines at its C-terminal end in a 

ing to the N-terminal part of a C«2 domain identical to the x sequence homologeous to the y3 hinge. Such second set of 

sequence of the 28 Kd fragment The availability of this clones could represent the IgG2 subclass. For the constant 

sequence data allowed the construction of an exact 3' primer part of the y3 and also for the putative y2, most clones are 

and the cloning of the region between the N-tenninal end of identical showing the y2 or y3 specific sequences. A few 

the Vjh, and the C^l domain. clones such as #72/1 however show minor differences. For 

5' primers corresponding to the mouse V ffl (16) and 55 instance in the case of clones #72/1 two nucleotide differ- 

containing a Xhol restriction site were used in conjunction ences are detected. 

with the 3* primer in which a Kpnl site had been inserted and Several regions cDNA's have now been totally or 

the amplified sequences were cloned into pBluescript*. partially sequenced with the exception of a short stretch at 

Clone #56/36 which displayed two internal HaeHI sites was the N-terminal end which is primer derived 

digested with this enzyme to produce a probe to identify «, Upon translation the majority shows the characteristic 

PCR positive clones. heavy chain Ser 21 CyS 22 and Tyr^ Tyr 91 Cys^ sequences, 

After amplification the PCR products were checked on a of the intra V ffl region disulfide bridge linking residues 22 

1.2% (w/v) agarose gel. Cleaning up of the PCR products and 92. All these clones have a sequence corresponding to 

included a phenol-chloroform extractio followed by further the framework 4 (FR4) residues of the variable region 

purification by HPLC (GEN-PAC FAX column, Waters) and 65 immediately preceding the postulated binge sequence (Table 

finally by using the MERMAID or GENECLEAN H kit, 3). This sequence is generated by the J minigenes and is i 

BIO 101, Inc) as appropriate. After these purification steps, the majority of cases si " 
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human and murine J minigenes. The sequence length 
between region Cys 92 and the C-terminal end of the W Hf1 
regions is variable and, in the sequences determined, range 
from 25 to 37 amino-acids as one might expect from the 
rearrangements of J and D minigenes varying in length. ; 

Several important questions are raised by the sole exist- 
ence of these heavy chain immunoglobulins in a non patho- 
logical situation. First of all, are they bonafide antibodies? 
The heavy chain immunoglobulins obtained from trypano- 
some infected camels react with a large number of parasite 11 
antigens as shown in part I of these examples. This implies 
that the camelid immune system generates an extensive 
number of binding sites composed of single V^j, domains. 
This is confirmed by the diversity of the regions of the 
heavy chain immunogobulins obtained by PCR. 1 

The second question is "how are they secreted?'. The 
secretion of immunoglobulin heavy chains composing four- 
chain model immunoglobulins does not occur under normal 
conditions. A chaperoning protein, the heavy chain binding 
protein, or BIP protein, prevents heavy chains from being 2 
secreted. It is only when the light chain dispplaces the BIP 
protein in the endoplasmatic reticulum that secretion can 
occur (13). 

The heavy chain dimer found in the serum of human or 2 
mice with the so-called "heavy chain disease" lack the C„l 
domains thought to harbour the BIP site (14). In the absence 
of thi domain the BIP protein can no longer bind and prevent 
the transport of the heavy chains. 

The presence in camels of a IgGl class composed of 3 
heavy and light chains making up between 25% and 50% of 
the total IgG molecules also raises the problem as to how 
maturation and class switching occurs and what the role of 
the light chain is. The camelid light chain appears unusually 
large and heterogeneous when examined in SDS PAGE 3 

The largest dimension of an isolated domain is 40 A and 
the maximum attainable span between binding sites of a 
conventional IgG with C#l and Vjflu will be of the order of 
160 A (2VHH+2CW1) (19). The deletion of C H 1 domain in 
the two types of heavy chain antibodies devoid of light 4 
chains, already sequenced has, as a result, a modification of 
this maximum span (FIG. 6). In the IgG3 the extreme 
distance between the extremities of the regions will be 
of the order of 80 A (2V« W ). This could be a severe 
limitation for agglutinating or cross linking. In the IgG2 mis 4 
is compensated by the extremely long stretch of hinge, 
composed of a 12-fold repeat of the sequence Pro-X (where 
X is Gin, Lys or Glu) and located N-terminal to the hinge 
disulfide bridges. In contrast, in the human IgG3, the very 
long hinge which also apparently arose as the result of 5 
sequence duplication does not contribute to increase the 
distance spanning the two binding sites as this hinge is 
inter-spersed with disulfide bridges. 

The single V ffl domain could also probably allow con- 
siderably rotational freedom of the binding site versus the Fc 5 
domain. 

Unlike myeloma heavy chains which result probably from 
C H l deletion in a single antibody producing cell, or heavy 
chain antibodies produced by expression cloning(15); the 
camelid heavy chain antibodies (devoid of light chains) have 
emerged in a normal immunological environment and it is 
expected that they will have undergone the selective refine- 
" " y and affinity accompanying B " 



Expression and Purification of the Camel V HH 21 
(DR21 on FIG. 7) Protein from E. coli 

The clones can be expressed in several types of expres- 
sion vectors. As an example using a commercially available 
vector Immuno PBS (Huse et al: Science (1989) 246, 1275), 
clones produced in Bluescript® according to the above 
described procedure, have been recovered by PCR using the 
same Xhol containing 5' primer and a new 3' primer, 
corresponding to residues 113-103 in the framework of the 
immunoglobulins, in which an Spe site has been con- 
structed: TC TTA ACT AGT GAG GAG ACG GTG ACC 
TG SEQ ID NO:. This procedure allowed Ihe cloning of (he 
V HH in the Xho/Spe site of the Immuno PBS vector. 
However, the 3' end of the gene was not in phase with the 
identification "tag" and the stop codon of the vector. To 
achieve this, the construct was cut with Spe and the 4 base 
overhangs were filled in. using the Klenow fragment after 
which the vector was religated. 

The expression vector plasmid ipBS (immunopBS) 
(Stratacyte) contains a pel B leader sequence which is 
used for immunoglobulin chain expression in E. coli 
under the promoter pLAC control, a ribosome binding 
site, and stop codons. In addition, it contains a sequence 
for a c-terminal decapeptide tag. 
E. coli JM101 harboring the ipBS-V^l plasmid was 
grown in 1 1 of TB medium with 100 ug/ml ampicdlfin 
and 0. 1% glucose at 32° C. Expression was induced by 
the addition of 1 mM IPTG (final concentration) at an 
ODjjo of 1.0. After overnight induction at 28° G, the 
cells were harvested by centrifugation at 4.000 g for 10 
min (4° C.) and resuspended in 10 ml TES buffer (0.2M 
Tris-HCLpH 8.0, 0.5 mM EDTA, 0.5M sucrose). The 
suspension was kept on ice for 2 hours. Periplasmic 
proteins were removed by osmotic shock by addition of 
20 ml TES buffer diluted 1 :4 v/v with water, kept on ice 
for one hour and subsequently centrifugated at 12.000 
g for 30 min. at 4° C The supernatant periplasmic 
fraction was dialysed against Iris-HQ pH 8.8, NaCl 50 
mM, applied on a fast Q Sepharose flow (Pharmacia) 
column, washed with the above buffer prior and eluted 
with a linear gradient of 50 mM to 1M NaCl in buffer. 
Fractions containing the V ffl protein were further 
purified on a Superdex 75 column (Pharmacia) equili- 
brated with PBS buffer (0.01M phosphate pH 72, 
0. 15M Nad). The yield of purified V ffl protein varies 
from 2 to 5 mgfl cell culture. 
Fractions were analyzed by SDS-PAGE(I). Positive iden- 
tification of the camel V^, antibody fragment was 
done by Western Blot analysis using antibody raised in 
rabbits against purified camel IgGH 3 and an anti-rabbit 
IgG-alkaline phosphatase conjugate (B). 
As protein standards (Pharmacia) periplasmic proteins 
prepared from 1 ml of IPTG-induced JMlOl/ipBS V^l 
were used. FIGS. 8A and 8B shows: effractions from fast 
S Sepharose column chromatography (C:Eluted at 650 mM 
NaCl Diluted at 700 mM NaQ) Effractions from Super- 
dex 75 column chromatography. 

As can be seen, the major impurity is eliminated by 



impurities are eliminated by gel filtration. 
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TABLE 1 
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Camel 
Clone 
Clone 
Clone 



Camel y 2 28Kd 

Clone # 72/1 

Clone # 72/4 

Clone #72/29 



K D V L S 

K D V L S 

K D V L S 

K D V L S 

K D T L M 

K D T L M 

K D T L M 



S G R P — — 



S R T P — — 



SEQ ID NO:54 
SEQ ID NO:55 
SEQ ID NO:56 
SEQ ID NO:57 
SEQ ID NO:112 
SEQ ID NO:113 
SEQ ID NO:114 



Camel % 14Kd — 



KK3 Q T R E P Q 
KK5 Q P R E P Q 



S R D E L 
S R E E M 



Q ID NO:54 
Q ID NO:115 
;Q ID NO:116 
Q ID NO: 117 



L V A S G 
L V E S G 
Q L L S G 



G G S L 
G G S L 
G G S L 
G G S L 
GSVQ A GGSL 



G S V G 
G L V E 



GGSL 
GGSL 
GGSL 



LSCEI SGLTFD 
LSCAVSGFSFS 
LSCAI SGYTYG 
LSCTVSGATYS 
LSCTGSGPPYS 
LSCTASGDSFS 
LSCAT SGFTF 
LSCAASGFTF 



SEQ ID NO:l 
SEQ ID NO:2 
SEQ ID NO-.3 
SEQ ID NO:4 
SEQ ID NO:5 



TABLE 3 



Comparison of some Framework 4 residues found in the Camel V ffl 




WGQGTLVTVSS SEQ ID NO 9 

WGRGTLVTVSS SEQIDNO:130 

WGQGTTVTVSS SEQ ID NO:120 

WGQGTMVTVSS SEQ ID NO: 121 

WGQGTTL TVSS SEQIDNO:122 

WGQGTLVTVSS SEQ ID NO S 

WGQGTSVTVSS SEQ ID NO:123 

WGAGTTVTVSS SEQ ID NO:124 



Camel WGQGTQVTVSS SEQ ID NO* 
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TABLE 3-continued 




WGQGTQVTV 
WGQGTLVTV 
WGRGTQVTV 
WGQGTHVTV 
WGQGIQVTA 



S S SEQ ID NOS 1 Clone 

S S SEQIDNO:59 #72/24 

S S SEQ ID NO:60 #72/21 

S S SEQIDNO:61 #72/16 



TABLE 4 

SEQ ID NO: 125 

dyygs 8 y--f d v WG AGTTVTVS S MOUSE V 

95 100 a b c d e f g h i j k 101 105 110 

1 a 1 q p g g y c g y g x = = = = = = = = = = c 1 WGQGT QVT VS S SEQIDNO:13 

2 v s 1 md r i a q h = = = = = = = = = = = = g c RGQGT QVT V S L SEQH)NO:14 

3 vpahl gpgai 1 dl kky = = = = = = ky WG QGTQVTVS S SEQIDNO:15 

4 fcystagdggsge= = = = = = = = = my WGQGT Q V T V S S SEQ ID NO: 16 

7 elsggscelpllf=========dyWGQGTQVTVSS SEQ ID NO: 17 

9 d w k y w t c g a q t g g y f = = = = = = = g q WGQGAQVT VS S SEQ ID NO: 18 

11 rltemgacdarwatlatrtfaynyWGQGTQVTVSS SEQ ID NO: 19 '. 

13 q k k d r t r w a e p r e w= = = = = = = = n n WGQ GT QVT AS S SEQIDNO:20 

16 gsrfsspvgstsrles=sdy==nyWGQGIQVTASS SEQ ID NO: 21 

17 adpsiyysiliiey========kyWGQGTQVTVSS SEQIDNO:22 18 different camel 

18 d s pcympt mpippi r d s fgw= = ddFGQGTQVTVSS SEQIDNO:23 VHW region 

19 t s s t y wy c t t a p y = = = = = = = = = n v WGQ GT QVT V S S SEQIDNO:24 

20 tei ewygcnl rt t f= = = = = = = = t r WG QGTQVTVS S SEQ ID NO: 25 

21 nql aggwyl dpnywl svgay = = ai WG QGTHVTVS S SEQIDNO:26 

24 rl t emgacdt rwal 1 at r t f ayny WG RGTQVTVS S SEQIDNO:27 

25 dgwt i keggi gl pws vqcedgyny WG QGTQVTVS S SEQIDNO:28 
27 dsypchll==============dvWGQGTQVTVSS SEQ ID NO: 29 

29 veypi admc s-------- --r y GDPGTQVTVS S SEQ ID NO: 30 

V ^ S 

CDR3 

Human & mouse - size range 0-19 aa over 600 entries 
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10 20 . 40 

EVQLVESGG LVQPGGSLRL SCAASG SEQ ID NO:80 ' CDR1 !WVRQA PGKGLEWVS SEQEDNO:8l! CDR2 

GG S VQGGGS LRL S CAI S G SEQIDNO:84j CDR1 ! WFREG P GEE RE GI A SEQIDNO:85! CDR2 

GGSVQAGGSLRL SCASSS EEQlDNO:88; CDR1 ! WYRQA P GEE REF VS SEQTDNO:89; CDR2 



70 80 90 110 

RF TI S RDNS KNTLYL Q MNS LRAEDTAVY YCAR SEQIDNO:82! C DR 3 ! WGQGTL VT VS S SEQIDNO:83 

RF TI S QDS TLKTMYL L MNNL EPEDTGTY YC A A SEQIDNO:86| C D R 3 ! WGQGTQVT VS S SEQIDNO:91 

RF TI S QDS AKNTVYL Q MNS LKP E DT AMY YCKI SEQEONOSO; CDR3 ! WGQGTQVT VS S SEQIDNO:87 



WGQGTQVT VS S 
WGQGTQVT VS S 


GTNEVCKCPKCP 

— EPKI PQPQPKPQPQP 
QPQPKPQP 
KPEPECTCPKCP 


C«2 

APELP GG P S VF VF P SEQ ID NO:91 
APELLGGPSVFI FP SEQIDNO:87 


human ChI 
humangamma3KVDKRV 

human gamma IK VDKK 

human gamma 2 K VK VT V 
human gamma 4 KVDKR V 


eletplgdithtcprcp 

e'p'kcsdtpppcprcp 

e'Ip'ksc dtpppcprcp 
aep ks cdkthtcpp cp 

erkccvecppcp 

eskygppcpscp 


-est 

APELLGG PSVFLFP SEQE)NO:126 
APELLGG PSVFLFP SEQIDNO:127 
APPVAG — PSVFLFP SEQE>NO:128 
APEFLGG PSVFLFP SEQE>NO:129 
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( i )SBQUENCE< 




( i i ) MOLECULE TYPE: peptide 



5,800,9: 



) FEATURE: 

( B ) LOCATION: 1. 



) SEQUENCE DESCRIPTION: SEQ ID N< 



< 2 ) INFORMATION FOR SEQ ID 



(A 
( B 

( c 

( D ) TOPOLOGY: liaeat 

( i i ) MOLECULE TYPE: peptide 

( i i ) FEATURE: 

( A ) NAME/KEY: Dorat 
( B ) LOCATION: 1..22 
(D). 



( 2 ) INFORMATION FOR SEQ ID N03: 



( A ) LENGTH: 22 xmu 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: 
( D )TOPOLOOY:IiiiH« 

( i i ) MOLECULE TYPE; peptide 



( B ) LOCATION: 1-22 



Oly G 1 y Ser Glo Oil OIj 

1 5 

Ser Oly Tyr Thi Tyr Oly 
( 2 ) INFORMATION FOR SEQ ID NO*: 



( B ) LOCATION: 1. 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



5.800,988 



2 ) INFORMATION FOR SBQ ID NOr5: 

( i ) SEQUENCE CHARACTERISTICS: 



( D ) TOPOLOGY: En 
) MOLECULE TYPE: peptide 



( B ) LOCATION: 1_22 
) SEQUENCE DESCRIPTION: SEQ ID NO* 



Gl jr Olj Se 



( 2 ) INFORMATION FOR SBQ ID NOA 




( X i ) SEQUENCE DESCRIPTION: SBQ ID NO& 

Oljr Olj Ser V»l Oil Ala Gly Ol j Sir Leo Aig Le 
1 S 10 

Gly Pbo Gly Ik: Set 



) MOLECULE TYPE: peptide 



1 a Gly Olj Se 



( 2 ) INFORMATION FOR SBQ ID NO:8: 



5,800,988 

37 38 

-continued 



) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 amino acids 
( B ) TYPE: amino acid 
( C )S1 
( D ) Tf 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
Trp Gly Ola Gly Thr Gin Vil Thr Val Sir Se 



( 2 ) INFORMATION FOR SEQ ID N< 



( D ) TOPOLOGY: linear 
( i i ) MOLECULE TYPE: peptide 



( l i ) SEQUENCE DESCRIPTION: SEQ ID NOft 
Tip Gly Gin Gly Thr Lea V»l Thr V»l Ser Se 



( 2 ) INFORMATION FOR SBQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS: 



( D )TOPOLOGY:mi 

( i i )M 



( D ) OTHER INFORMATION: Aabel=FRAMEWORK 4 



: ) INFORMATION FOR SEQ ID 



( D ) TOPOLOGY: hi 



( D ) OTHER INFORMATION: rtabel=FRAMEWORK 4 
) SEQUENCE DESCRIPTION: SEQ ID NO=ll: 



5,800,988 
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( A ) NAME/KEY: Domain 
( B ) LOCATION: 1.11 

( D ) OTHER INFORMATION: /label=FRAMEWORK 4 
( x i ) SEQUENCE DESCRIPTION: SBQ ID NO:12: 
Aig Oly Gin Oly Thr Gla Vil Thi V»l Set Leu 



(C)STRANDEDNE! 
( D ) TOPOLOGY: B« 

( i i )M 

( i X ) FEATURJ 
(A 



( i i ) SEQUENCE DESCRIPTION: SBQ ID NO03: 
Ala Lei Gin Pro Oly Oly Tyt Cy» Gly Tyi Gly III Cyi 



) INFORMATION FOR SBQ ID NO:14: 



) SEQUENCE DESCRIPTION: SBQ ID NO: 14: 




5,800,988 

-continued 



( i i ) MOLECULE TYPE: peptide 

< i i ) FEATURE: 

( A ) NAME/KEY: Domain 

( B ) LOCATION: 1.18 

( D ) OTHER INFORMATION: /label=CDR3 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Vil Pro Ala His Leu Gly Pro Gly 
1 5 

Lys Tyr Trp Gly Gin Gly Til r Gin 
2 0 

( 2 ) INFORMATION FOR SEQ ID NO: 16: 



) MOLECULE TYPE: per 



SEQIDNChlo: 
lit Ala Gly 




•P Tyr Trp 



Gly Gin Gly Thr Gl 



( 2 ) INFORMATION FOR SEQ ID NO: 18: 

( i ) SEQUENCE CHARACTERISTICS: 



( i i ) MOLECULE TYPE: peptide 



( I i ) SEQUENCE DESCRIPTION: SEQ ID NO-.18: 



5,800,988 

43 

-continued 



Asp Tip Ly s T,r Tr 
1 5 

Gin Tip Gly Gin Gl 
2 0 

( 2 ) INFORMATION FOR SEQ ID NO*.19: 

QUENCEC 
(A)U 



( D ) TOPOLOGY: fin 



(i i ) SEQUENCE DESCRIPTION: SBQ ID NO:lS: 

Atg Lea Thr Gin Mel Gly Ala Cy« A.p All Arg Trp All Thr Le 

1 5 10 15 

Thr Arg Thr Pit Ala Tyr An Tyr Trp Gly Ola Gly Thr Gin Va 



( 2 > INFORMATION FOR SBQ ID NO30: 

( i ) SEQUENCE CHARACTERISTIC 
( A ) LENGTH; 27 amino t 
( B ) TYPE: anno acid 

( D ) TOPOLOGY: Bacar 

( i i ) MOLECULE TYPE: peptide 



( 2 ) INFORMATION FOR SBQ ID N021: 



) FEATURE: 

( A )Nj 
( B ) LOCATION: 1-21 



Gly Set Thr Se 
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-continued 
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) INFORMATION FOR SEQ IE 



( D ) TOPOLOGY: linear 
) MOLECULE TYPE: peptide 



( X i ) SEQUENCE DESCRIPTION: SEQ ID N022: 

Ala Alp Pro S <s r lie Tyt Tyr Ser lie Leu Xaa lie G 1 u Tyr Ly» T, 
1 5 10 15 

Trp Ol j Ola Gly Thr Ola Val Thr Val Ser Ser 
) INFORMATION FOR SBQIDN033: 



( D ) TOPOLOGY: Hi 

( i i )M 
( i x ) FE 



( B ) LOCATION: 1-22 
( D ) OTHER INPORMAT1 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N023: 

Alf Ser Pro Cyi Tyr Met Pro Thr Met Pro Ala Pro Pr 
1 5 10 

Ser Phe Gly Trp A.p A.p Phe Gly Ola Gly Thr Ola Va 



( 2 ) INFORMATION FOR SEQ ID N024: 



C i i ) MOLECULE TYPE: peptide 

( i x ) FEATURE: 

( A ) NAME/KEY: Domail 
( B ) LOCATION: 1.13 



) SEQUENCE DESCRIPTION: SFQ TD N024: 

r Ser Set Phe Tyr Trp Tyr Cy« Thr Thr Ala Pro Tyr Ana Val Tr 
5 10 15 

y Gla Gly Thr Gin Val Thr Val Ser Ser 
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-continued 
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( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

III Gin lie Gin Tip Tyr Gly Cyi An Leu Arg Thr Thi P b e Hi 
1 5 10 15 

Tip Gly Oil Gly Thr Gin Val Tht Val Sei Ser 



( 2 ) [INFORMATION FOR SEQ ID NOSrj: 

( i ) SEQUENCE CHARACTERISTICS: 
( B ) TYPE: an 



( D ) TOPOLOGY: hx 
) MOLECULE TYPE: peptide 



n Oil Lei Al* Gly Gly Tip Tyi Leo Aip Pro An 
5 1 0 

.1 Gly All Tyi All lie Tip Gly Gil Gly Tht Hit 



( 2 ) INFORMATION FOR SBQ ID NO:27: 



( D ) TOPOLOGY: Been 
( • i ) MOLECULE TYPE: peptide 



( B ) LOCATION: 1-24 



(I i )! 

At| Leu Thr Ola Met Gly All Cyi Aip Ala Aig Tip All Thr Leo Ala 
1 5 10 15 

Thr Arg Thr Phe Ala Tyr Ail Tyr Tip Gly Aig Gly Thi Gin Val Thr 



FORMATION FOR SBQ ID N028: 
( i )SE 
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( D ) OTHER INFORMATION: /label=CH2 
) SEQUENCE DESCRIPTION: SEQ ID NO:31 : 



) INFORMATION FOR SEQ ID N032: 

( i ) SEQUENCE CHARACTERISTICS: 




( 2 ) INFORMATION FOR SBQ ID N033: 
( i )SBQJ 

( A )LE 
( B )TY 
( C )ST 
( D ) TOPOLOGY: fa 



(A) ] 

(B) l 
( D )< 



) INPORMAnON FOR SBQ ED N034: 



( D ) TOPOLOGY: in 
i i ) MOLECULE TYPE: peptide 



) SEQUENCE DESCRIPTION: SBQ ID N034: 

a Pro Ola Leu Leo G 1 jr Gly Pt 
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( 2 ) INFORMATION FOR SEQ ID NCH36: 



( B )TYPE;jmjK>.ei 
( C ) STRANDEDNE 
( D ) TOPOLOGY: tin 



( B ) LOCATION: 1..18 



( 2 ) INFORMATION FOR SEQ ID NOS7 : 



i ) MOLECULE TYPE: peptide 



) SEQUENCE DESCRIPTION: SEQ ID NO'37: 

y Thr Asn Gil Val Cy» Ly. Cyi Pro Lys Cy« 



( 2 ) INFORMATION FOR SEQ ID NOJ8: 



(B )T1 
(C )S1 
( D ) TOPOLOGY: till 



( i i ) MOLECULE TYPE: peptide 
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( 

( D ) OTHER INFORMATION: 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Glu Pro Ly> lie Pro Oil Pro Gin Pro Lys Pro Gin Pro Gin Pro Gl 
1 5 10 15 

Pro Gla Pro Lys Pro Gin Pro Lys Pro Glu Pro Gin Cys Thr Cys Pi 

2 0 25 3 0 

Lys Cys Pro 
3 5 

( 2 ) INFORMATION FOR SEQ ID N039: 



(B )1 
(C)S1 
( D ) TOPOLOGY: En 



( i i )M 
( i x )FE 

( B ) LOCATION: 1-28 



( x • )SE 

Ala Pro Glu Leu Leu Oly Oly Pro Ser Val Phe Val Phe Pro Pro Lyi 



( 2 ) INFORMATION FOR SEQ ID NO*0: 



( D ) TOPOLOGY: bnear 

( i i ) MOLECULE TYPE: peptide 

( i z ) FEATURE: 

( A ) NAME/KEY: Dcma 

( D ) OTHER INFORMATION: A 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.40: 

Ala Pro Ola Leu Pro Oly Oly 



( i ) 

( A ) LENGTH: 28 amino 
( B ) TYPE: annuo acid 



( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i I ) FEATURE: 

( A ) NAME/KEY: Domain 

( B ) LOCATION: 1-28 

( D ) OTHER INFORMATION: rtabel=CH2 
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) INFORMATION FOR SEQ ID NO:42: 

( i ) SEQUENCE CHARACTERISTICS: 



( D ) TOPOLOGY: Be 
( i i ) MOLECULE TYPE: peptide 



Pbe Pro Pro Ly» 



( A)LI 
( B )T1 
(C)S1 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOrtJ: 

Val Thi Val Set Set Gly Thl Am Gin Val Cji Lys 




G 1 y Gly Pi 
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( A ) NAME/KEY: Region 

( B ) LOCATION: 1..14 

( D ) OTHER INFORMATION: Aabel=fail]ge 

( i I ) FEATURE: 

( A ) NAME/KEY: Domain 

( B ) LOCATION: 1.14 

( D ) OTHER INFORMATION: /label=CH2 

( x i ) SEQUENCE DESCRIPTION: SBQ ID NO-.45: 

All Pro Oil Leu Pro Oly Oly Pro Ser V»l Phi V.l Pbe Pro 



( 2 ) INFORMATION FOR SBQ ID NO*6: 




Oly Oly Pr 



( D ) TOPOLOGY: b> 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO>47: 
COCCATCAAO OTAACAOTTO A 



( 2 ) INFORMATION FOR SBQ IDNOjtt: 
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( x i ) SEQUENCE DESCRIPTION: SEQ ID NO>»: 
AOCTCCAGCT OCTCGAOTCT OO 

( 2 ) INFORMATION FOR SEQ ID NO-JO: 



( D )TOPOLOOY:lii 



) DESaUPIlON: DNA (lyttfaak) 



(I i )SE 

AOOTCCAOCI TCTCOAGTCT OO 




( I i ) SEQUENCE DESCRIPTION: SEQ ID N031: 
TCTTAACTAG TGAGGAGACG GTGACCTO 

( 2 ) INFORMATION FOR SEQ ID N032: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 hue pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: in 



( i I ) FEATURE: 
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( x i ) SEQUENCE DESCRIPTION: SEQ ID N052: 
CTAGTGCACC ACCATCACCA TCACTAATA 

( 2 ) INFORMATION FOR SEQ ID NO:53: 



( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sh 
( D ) TOPOLOGY: linear 



( B ) LOCATION: 1.30 



COTOOTQOT AOTOGTAGTG ATTATCTTA 



( 2 ) INFORMATION FOR SEQ ID NO _54 



( B ) TYPE: anno ac 
(C )STRANDEDNB! 
( D ) TOPOLOGY: lb 



( v ) FRAGMENT TYPE: N-tommial 
(r i )« 



) LOCATION: t-23 



( x i ) SEQUENCE DESCRIPTION: SBQ ID NO**: 

Leo Pro Gly Gly Pro Ser Val Phe Val Phe Pro Pro Ly« Pro Lyi A.p 

1 5 10 15 

V»l Leu Ser lie Xaa Gly Xaa Pro Ly« Oly Oil Thr Arg Gin Pro G 1 m 

Val Tyr Tbr Leo Ala Pro Xaa Arg Lea Glu Lea 
3 5 4 0 

( 2 ) INFORMATION FOR SEQ ID N035: 



( D ) TOPOLOGY: B« 
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( i i ) MOLECULE TYPE: peptide 

( v ) FRAGMENT TYPE: N-temnnal 
( v i ) ORIGINAL SOURCE: 

( i I ) FEATURE: 

( A ) NAME/KEY: Domain 
( B ) LOCATION: 1„24 
( D ) OTHER INFORMATION: flabefcCKL 
/ note= "Clone #72/1" 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N035: 

Lea Pro Oly Oly Pro Ser Val Phe Val Phe Pro Thr Ly> Pro Lyi Aip 
1 5 10 15 



( 2 ) INFORMATION FOR SEQ ID 



( D )TOPOLOOY:Kn 



( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Cmetas dramedarius 

( i x ) FEATURE: 

( A ) NAME/KEY: Domain 

( B ) LOCATION: 1M 

( D ) OTHER INFORMATION: /labd=CH2 

( I i ) SEQUENCE DESCRIPTION: SEQ ID NOS6: 

Loo Pro Gly Oly Pro Ser Val Phe Val Phe Pro Pro Lyi Pro Ly. Aip 
1 5 10 15 



( 2 ) INFORMATION FOR SEQ ID N037: 



( D ) TOPOLOGY: ha 



( A ) NAME/KEY: Domain 

( B ) LOCATION: 1_24 

( D ) OTHER INFORMATION: /label=CH2 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

Leo Leo Gly Gly Pro Ser Val Phe lie Phe Pro Pro Lyi Pro Lyi A»p 



Val Leo Ser lie Ser Gly Arg Pro 
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( 2 ) INFORMATION FOR SEQ ID NO:58: 

( i ) SEQUENCE CHARACTERS 



( D ) TOPOLOGY: linear 
) MOLECULE TYPE: peptide 



) FEATURE: 

( A )!* 
( B ) LOCATION: 1 JO 



e r Va I Gly Ala Gly 
.p Set Phc Se, 



) SEQUENCE CHARACTERISTICS: 

( B ) TYPE: ammo acid 

( C ) STRANDBDNESS: Bogie 

( D ) TOPOLOGY: linear 

) MOLECULE TYPE: peptide 



<A)i 



( A)NJ 
( B ) LOCATION: 1. 
(D)OT 



( 2 ) INFORMATION FOR SBQ TO NO*!0: 



a I T h i Va 1 S e 



( A ) LENGTH 
( B ) TYPE: an 

(C)sr 

( D ) TOPOLOGY: lb 
) MOLECULE TYPE: peptide 
LL SOURCE: 



( A)l 
( B ) LOCATION: 1.11 
(D)« 



Tip Gly Ola Gly Tbr 



( 2 ) INFORMATION FOR SBQ ID NO*l: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 amino acids 

( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i x ) FEATURE: 

( A ) NAME/KEY: Domain 
( B ) LOCATION: 1..11 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:61: 



) SEQUENCE CHARACTERISTICS: 
( B ) TYPE: amino acid 




Cji Gly Tyr 



( 2 ) INFORMATION FOR SEQ ID NOrf3: 



( A ) NAME/KEY: Region 

( B ) LOCATION: 1-12 

( D ) OTHER INFORMATION: AalxMVH 

( i x ) FEATURE: 

( A ) NAME/KEY: Domain 

( B ) LOCATION: 1-12 

( D ) OTHER INFORMATION: /labet=CDR3 

( I i ) SEQUENCE DESCRIPTION: SEQ ID NOSS3: 

Val Ser Leu Met Asp Arg lie Set Gin His Gly Cys 
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( D ) TOPOLOGY: lil 
( i i ) MOLECULE TYPE: protein 



) ORIGINAL SOURCE: 



( B ) LOCATION: 1_18 



( i i ) SEQUENCE DESCRIPTION; SEQ ID NO*4: 
Val Pro Ala His Leo Gly Pro Oly All lie Leu Asp Leu Lys Ly. Tyr 



( 2 ) INFORMATION FOR SEQ ID NCkfi5: 
( i )SE 



( D ) TOPOLOGY: nil 



( t i ) ORIGINAL SOURCE: 



( B ) LOCATION: 1-13 



( i X )FB» 

( B ) LOCATION: 1.1 J 



( X i ) SEQUENCE DESCRIPTION: SEQ ID NOxr5: 
Phe Cys Tyr Ser lb: AU Gly Asp Gly Gly Ser Oly Oil Met Ty 
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( 2 ) INFORMATION FOR SEQ ID NO:67: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 17 amino adds 
( B ) TYPE: amino acid 

< C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Enear 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*7: 
Asp Tip Lys Tyr Trp Tor Cys Oly Ala Oil T h i Gly Oly Tyi Phc Gly 



) INFORMATION FOR SEQ ID NO:68: 



( i I ) FEATURE: 

( A)tt 
( B ) LOCATION: 1 -24 



( i X ) FEATURE: 

( A)N, 
( B ) LOCATION: 1-24 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*8: 



( 2 ) INFORMATION FOR SEQ ID NO®: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 16 anno acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



) FEATURE: 

( B ) LOCATION: 1-16 



( i x )FI 

( A ) NAMBKEY: Domain 

( B ) LOCATION: 1.16 

( D > OTHER INFORMATION: rtabd=CDR3 
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) SEQUENCE DESCRIPTION: SEQ ID NO:®: 



( D ) TOPOLOGY: lii 
( i i ) MOLECULE TYPE: protein 



( B ) LOCATION: 1.- 
(D)a 

) FEATURE: 

( B ) LOCATION: 1_21 



lESCMPTION: SEQ ID NO-.70: 

r 8 Phe Stt Ser Pro V»l G 1 jr Sir Thr Set Arg Leu Oln St 



( 2 ) INFORMATION FOR SEQ ID NO:71: 

( i ) SEQUENCE CHARACTERISTICS: 



( A)N> 
(B)LC 
(D)Ol 

QUBNCE DESCRIPTION: SEQ ID NO-.71: 

Asp Pro Ser lie Ijr Tjr Ser lie Lea In lie Oln Iyr Ly> Tyr 
FOR SBQ ID NO:72: 



) LOCATION: 1.22 



( i i ) FEATURE: 

( B ) LOCATION: 1..22 
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( 2 )INK*MATIONFORSEQIDNO:73: 

( i ) SEQUENCE CHARACTBRI 
( A ) LENGTH: 15 an 



< D ) TOPOLOGY: En 
( i i ) MOLECULE TYPE: proton 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
Thr Set Set Pbi Tyr Trp 



2 ) INFORMATION FOR SEQ ID NOl74: 



( B )TYPE:«m 
( C ) STRAND! 
( D ) TOPOLOGY: Hn 



( A ) NAME/KEY: Regie 
( B ) LOCATION: 1-16 
(D)O 

( i x ) FEATURE: 

( A ) NAME/KEY: Domam 

( B ) LOCATION: 1_1G 

( D ) OTHER INFORMATION: /tobel=CDR3 

( I i ) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

Thr Olu lie Oln Trp Tjr Gly Cyi A s n Leo Arg Tbr Thr 
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) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
B CU Leu Ala G 1 y Gly 



» a Ty r Tip Lt 



a 1 Oly Ala T y i 



( 2 ) INFORMATION FOR SEQ ID 



( B ) TYPE: m 
( C ) STRAND! 
( D ) TOPOLOGY: fit 

) MOLECULE TYPE: protein 



( B ) LOCATION: L 



<A)NJ 
( B ) LOCATION: 1..24 
( D ) OTHER INFORMATION: /labeKDR3 



Th r Oil M « 



( 2 ) INFORMATION POR SEQ ID NO:77: 
( i )SE 




( i i ) MOLECULE TYPE: proKill 



) OTHER INFORMATION: /labd=VH 



) SEQUENCE DESCRIPTION: SEQ ID NO:78: 



( 2 ) INFORMATION FOR SEQ ID NO:79: 




) SEQUENCE DESCRIPTION: SEQ ID NCW9: 



( 2 ) INFORMATION FOR SEQ IDNO:80: 

( > ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH 26 annuo acids 
( B ) TYPE: amino acid 

< C ) STRANDEDNESS: angle 

< D ) TOPOLOGY: Knaar 

( i • )M« 



) SEQUENCE DESCRIPTION: SEQ ID NO*): 
n Val Gin Leo Val Gin 



( 2 ) INFORMATION FOR SBQ ID NOS1: 



( B ) TYPE: amino ac 
( C ) STRANDEDNEv 
( D ) TOPOLOGY: tin 



) MOLECULE TYPE: protein 



83 



5,800,988 

-continued 



84 




( I i ) SEQUENCE DESCRIPTION: SBQ ID NO:85: 

Trp Phe Aig OH Gl y Pro Oly Lyi Oil Arg OH Gly lie Ala 
1 5 tO 

( 2 ) INFORMATION FOR SBQ ID NCK86: 

( i ) SEQUENCE CHARACTERISTICS: 
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) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
g Phe III lie S e i Gin 



ip Til Oly III Tyr Tyi Cyi 



( 2 ) INFORMATION FOR SEQ ID NO:87: 



( A ) LENGTH: 69 amino a 
( C ) STRANDEDNESS: M 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:87: 
Tip Gly Oil Gly Thr Gin 



Gin Pro Gin Pi 



Ly. Pro Oil Pro OIu Cy. Thr Cy. Pro Ly» Cy. Pi 
3 S 4 0 4 5 

Gly Gly Pro Ser V»l Pho lie Phe Pro 



5 0 

( 2 ) INFORMATION FOR SEQ ID NOS8: 

( i ) SEQUENCE CHARACTERISTICS: 

( D ) TOPOLOGY: Enear ""^ 
( i i ) MOLECULE TYPE: proton 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



( 2 ) INFORMATION FOR SEQ ID NO*: 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
Tip Tyi Aig Gin Ala Pro 



( i ) SEQUENCE CHARACTERISTICS: 
( B ) TYPE: amino acid 



87 



5,800,988 

-continued 



88 



) MOLECULE TYPE: protein 



( 2 ) INFORMATION FOR SEQ ID NOS1: 



( D ) TOPOLOGY: fin 

( i i ) MOLECULE TYPE: prolan 

( i i ) SEQUENCE DESCRIPTION: SBQ ID N041: 

Trp Oly Oil Oly T h r Gin V*l T h r V»l Ser Ser Oly Tht 

1 3 10 

Cy» Ly« Cji Pfo L;i Cji Pro Ala Pro Oil Leu Pro Oly 

2 0 2 5 

V» 1 Pie V» 1 Phe Pro 

3 3 

) INFORMATION FOR SBQ ID NOr9!2: 

QUENCE CHARACTER1 
( A ) LENGTH: 4O0b 

( C )STRANDEDNE 
( D ) TOPOLOGY: Hi 



CTCGAGTCTO OGGOAOOATC GOTGCAOOCT GGAGGGTCTC TOAOACTCTC OIOCOCAOCC 

TCTGGATACA OTAATTGTCC CCTCACTTGG AOCTOOTATC GCCAGTTTCC AOGAACGGAG 

COCGAGTTCO TCTCCAGTAT OOATCCOOAT OOAAATACCA AOTACACATA CTCCOTGAAO 

OOCCOCTICA CCATGTCCCG AflOCAOCACC GAGTACACAG IATITCTOCA AATGGACAAT 

CTGAAACCTG AGGACACOOC GATGTATTAC TGTAAAACAG CCCTACAACC TOGGGGTTAT 

TOTOOGTATO GGTANTGCCT CIOOOOCCAO OOOACCCAOO TCACCOTCTC CTCACTAGTT 

ACCCGTACGA COTTCCGGAC IACOOIICTT AATAGAATTC 

( 2 ) INFORMATION FOR SEQ ID NO*): 



( D ) TOPOLOGY: BaM 
( i i ) MOLECULE TYPE: cDNA 



GCTC GGTGCAGGCT OOAOOOICTC TOAGACTCTC CTGTGCATCT 



ATATT GATGOTAAGA CAACATACOC AGACTCCGTG 
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AAGOGCCOAT TCACCATCTC CCAAGACAGC GCCAAGAACA CGGTGTATCT GCAGATGAAC 

AGCCTGAAAC CTOAGGACAC GGCGATGTAT TACTGTAAAA TAGATTCGTA CCCGTGCCAT 

CTCCTTGATG TCTGGGGCCA GGGGACCCAG GTCACCGTCT CCTCACTAGT TACCCGTACG 

AGCTTCCGGA CTACGGTTCT TAATAGAATT C 

( 2 ) INFORMATION FOR SEQ ID NO:94: 



( A ) LENGTH: 443 base pairs 
( B ) TYPE: nucleic arid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( I i ) SEQUENCE DESCRIPTION: SEQ ID NOS4: 

CAGGTGAAAC TGCTCGAGTC TGGAGGAGGC TCGGTGCAGA CTOGAGGATC TCTOAGACTC 

TCCTGTOCAG TCTCTGGATT CTCCTTTAOT ACCAGTTGTA TOOCCIOOTT CCOCCAGGCT 

TCAGGAAAGC AGCGTGAOOG GGTCGCAGCC ATTAATAGTG GCGGTGGTAG GACATACTAC 

AACACATATG TCGCCGAOTC CGTGAAGGGC COATTCGCCA TCTCCCAAGA CAACOCCAAG 

ACCACGGTAT ATCTTGATAT GAACAACCTA ACCCCTGAAG ACACOGCTAC GTATTACTGT 

GCGQCGGTCC CAOCCCACTT GGGACCTGGC OCCAIICIIO ATTTGAAAAA GTATAAGTAC 

TGGGGCCAGG OGACCCAGGT CACCOTCTCC TCACTAGCTA GTTACCCGTA CGACGTTCCG 

GACTACGGTT CTTAATAGAA TTC 

( 2 ) INFORMATION FOR SEQ ID NOS5: 



) LENGTH: 433 base pai 



(zi)SE 

CTCGAGTCTG OGGGAGGGTC GOTOCAGOCT GGAGGGTCTC TGAGACTCTC CTGTAATGTC 

TCTGGCTCTC CCAGTAGTAC TTATTOCCTG OOCTOOIICC GCCAGOCTCC AOGGAGGGAG 

CGTGAGGGGG TCACAGCGAT TAACACTGAT GGCAOTATCA TATACOCAGC COACTCCOIO 

AAGOGCCOAT TCACCATCTC CCAAGACACC GCCAAGOAAA COOTACATCT CCAGATGAAC 

AACCTGCAAC CTGAGOATAC GOCCACCTAT TACTOCOCGO CAAGACTGAC GGAGATGGGG 

GCTTGTGATG CGAGATGGGC OACCTTAOCG ACAAOGACGT TTGCGTATAA CTACTGGGGC 

CGGGGGACCC AGGTCACCGT CTCCTCACTA GTTACCCGTA CGACGTTCCG GACTACGGTT 

CTTAATAGAA TTC 

( 2 ) INFORMATION FOR SEQ ID NOS6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 449 base pars 

( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 
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TGCTCGAGTC TGGGGGAGGG TCGGTGCAGG CTGG 



GCCOACTCCG TGAAGGGCCO ATTCACCATC TCCCAAOACA CCGCCAAGAA AACGGTATAT 

CTCCAGATGA ACAACCTGCA ACCTGAGGAT ACOOCCACCI ATTACTGCGC GGCAAGACTG 

ACGGAOATGG GGGCTTGTGA TGCGAGATGG OCGACCTTAG CG AC A AGO AC GTTTGCGTAT 

AACTACTGOG GCCGOGGGAC CCAOGTCACC GTCTCCTCAC TAGCTAGTTA CCCGTACGAC 

GTTCCOGACT ACGGTTCTTA ATAOAATTC 



) INFORMATION FOR SBQ ID NOS7: 




GCAGOCT OOAGOAICTC TGAGACTCTC CTGTOCAGCC 

CACQOOAIIC CGCTCAATOO TTACTACATC GCCTGGTTCC OTCAOOCTCC TGGGAAGGGG 

COIOAOOOOO TCGCAACAAT IAAIOOIOOI COCOACOICA CATACTACOC COACTCCOTG 

ACGOGCCGAT TTACCATCTC CCOAOACAGC CCCAAGAATA CGOTGTATCT GCAGATOAAC 

AGCCTGAAAC CTOAOOACAC OGCCATCTAC TTCTGTGCAO CAGGCTCGCO IIIITCTAOT 

CCTOITOOOA GCACTTCTAO ACTCOAAAOT AOCGACTATA ACIAIIOOOO CCAGOOGATC 

CAGOTCACCO TCACCTCACT AGTTACCCGT ACGACGTTCC OOACIACOOT TCTTAATAGA 
ATTC 

( 2 ) INFORMATION FOR SBQ ID NOS8: 




N: SBQIDN038: 

CTCOAGTCTG GAGOAGGCTC GOTTCAGOCT OGAGOOTCCC TTAGACTCTC CTGTOCAGCC 

TCTGACTACA CCATCACTGA TTATTGCATG GCCTGGTTCC OCCAOOCTCC AGGGAAGGAG 

CGTGAATTGG TCGCAGCGAT TCAAGTTGTC CGTAGTGATA CTCOCCTCAC AGACTACGCC 

GACTCCGTGA AOOOACOATT CACCATCTCC CAAOGCAACA CCAAOAACAC AGTGAATCTG 

CAAATGAACA OCCTGACACC TOAGGACACG GCCATCTACA GTTGTGCGOC AACCAGTAGT 

TTTTACTGGT ACTGCACCAC GOCGCCTTAT AACOTCTOOO GTCAGGGGAC CCAOGTCACC 

GTCTCCTCAC TAGTTACCCG TACGACGTTC CGGACTACOO TTCTTAATAG AATTC 

( 2 )1 
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( i i ) MOLECULE TYPE: cDNA 

(Hi) SEQUENCE DESCRIPTION: SEQ ID NOS9: 

CTCGAGTCTG GOGGAGGCTC GGTGCAGGGT GGAGGGTCTC TGAGACTCTC CTGTOCAATC 

TCTGGATACA CGTACGGTAG CTTCTGTATG GGCTGGTTCC GCGAGGGTCC AGGCAAGGAA 

CGTGAGGGGA TCOCAACTAT TCTTAATGGT GGTACTAACA CATACTATGC CGACTCGGTG 

AAGOGCCGAT TCACCATCTC CCAAGACAGC ACGTTGAAGA CGATGTATCT GCTAATGAAC 

AACCTGAAAC CTGAAGACAC GGGCACCTAT TACTGTGCTG CAGAACTAAG TGGTGGTAGT 

TGTGAATTGC CTTTGCTATT TGACTACTGG GGCCAGGGCA CCCAOGTCAC CGTCTCCTCA 

CTAGTTACCC OTACOACOTT CCGGACTACG OTTCTTAATA GAATTC 

( 2 ) INFORMATION FOR SEQ ID NO: 100: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 427 base pars 
( B ) TYPE: nucleic add 



( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:100: 

CTCGAGTCTG GOGGAGGCTC OOTGCAGGCT GGAGGGTCTC TGAGACTCTC CTGTACAOOC 

TCTGGATTCC CCTATAGTAC CTTCTGTCTO OOOTOOTTCC GCCAGGCTCC AGGGAAGGAG 

CGTGAGOGGG TCOCOGOTAT TAATAGTGCA GGAOGTAATA C T T ACT A T GC COACOCCOTO 

AAGOGCCGAT TCACCATCTC CCAAGGGAAT G C C A AG A AT A COGTGTTTCT GCAAATOGAT 

AACTTGAAAC CTOAOGACAC OOCCATCTAT TACTGCGCGG COGATAOTCC ATGTTACATO 

CCOACTATGC CCGCTCCCCC GAT ACGAGAC AGTTTTGGCT GOOATGATTT TGGCCAGOOO 

ACCCAGGTCA CCOICICCIC ACTAGTTACC CGTACGACOT TCCOGACTAC GOTTCTTAAT 
AGAAT T C 



( D ) TOPOLOGY: Hi 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:101: 

CTCOAGTCAO OGOGAGOCTC GOT AC AGOTT GGAGGGTCTC TGAGACTCTC CTOTGTAOCC 60 

TCTACTCACA CCGACAGTAG CACCTGTATA GGCTGGTTCC GCCAGGCTCC AOGOAAOOAO 120 

CGCGAGGGGG TCGCAAGTAT ATATTTTGGT GATGGTGGTA CGAATTATCG CGACTCCGTO 180 

AAGGGCCGAT TCACCATCTC CCAACTCAAC GCCCAGAACA CAGTGTATCT OCAAATGAAC 240 

AGCCTGAAAC CTGAGGACAG CGCCATGTAC TACTGTGCAA TCACTGAAAT TGAGTGGTAT 3 00 

GGGTGCAATT TAAGGACTAC TTTTACTCGC TOOGOCCAGG GOACCCAOGT CACCGTCTCC 360 

TCACTAGTTA CCCGTACGAC GTTCCGGACT ACGGTTCTTA ATAGAATTC 409 

( 2 ) INFORMATION FOR SBQ ID NO:102: 

( i ) SEQUENCE CHARACTERISTICS: 



95 



5,800,988 

-continued 



96 



) STRANDEDNESS: single 



( i i ) MOLECULE TYPE: cDNA 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:102: 

CTCOAOTCTG GOOOAOOCIC OGTACAAACT OGASOOTCTC TGAGACTCTC TTGCGAAATC 

TCTGGATTGA CTTTTGATOA TTCTGACGTG GGGTOGTACC OCCAOOCTCC AGGGGATGAG 

TGCAAATTOG TCTCAGOTAT TCTOAGTGAT QGTACTCCAT ATACAAAGAG TOOAGACTAT 

GCTGAGTCTG TGAGGGGCCG GGTTACCATC TCCAGAGACA ACGCCAAGAA CATGATATAC 

CTTCAAATGA ACGACCTGAA ACCTOAOGAC ACGGCCATGT ATTACTGCGC GGTAGATOGT 

TGGACCCOOA AGGAAGGOOG AATCGGGTTA CCCTOGTCGO TCCAATGTGA AGATGGTTAT 

AACTATTOGG GCCAGGGGAC CCAOflTCACC GTCTCCTCAC TAGTTACCCO TACOACGTTC 

CGGACTACGG TTCTTAATAG AATTC 

( 2 ) INFORMATION FOR SEQ ID NCM03: 

( A ) LENGTH: 394 bate pain 
( B ) TYPE: anckic add 
( C > STRANDEDNESS: .ingle 
( D ) TOPOLOGY: Haaar 

( i i ) MOLECULE TYPE: cDNA 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:103: 

CTCGAGTCTG OAOOAOOCTC OOIOCAOOCI OOAOOOICTC TOAOACTCTC CTGTOTAOCC 

TCTGGATTCA AITTCOAAAC IICICOIATO OCOIOOIACC GCCAGACTCC AOGAAATGTO 

TOIOAOTIOO TCTCAAOTAT TTACAGTOAT GOCAAAACAT ACTACGTCGA CCGCATOAAO 

OOCCGATTCA CCATTTCTAG AGAGAATGCC AAGAATACAT TOTATCTACA ACTGAGCOOC 

CTCAAACCTG AOGACACOOC CATGTATTAC IOIOCOCCOO ITOAAIAICC TATTGCAGAC 

ATGTGTTCGA GATACGGCGA CCCOOOGACC CAOGTCACCG TCTCCTCACT AOTTACCCOT 

ACGACGAACC OOACTACGOT TCTTAATAGA ATTC 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 433 baaepakf 
( B ) TYPE: aoefaic add 



( D > TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

CTCGAGTCTG OGGGAOGCTC GGTGCAGGTT OOAOOOTCTC TGAAACTCTC CTGTAAAATC 

TCTGGAOGTA CCCCAOATCG TOTTCCTAAA TCTTTGGCCT GGTTCCGCCA GGCTCCAGAG 

AAGGAOCGCG AGGGOATCOC AOTTCTTTCG ACT AAGGATG GTAAGACATT CTATOCCGAC 

TCCGTGAAGG OCCGATTCAC CATCTTCTTA GATAATGACA AGACCACTTT CTCCTTACAA 

CTTGATCGAC TGAACCCGGA OOACACTGCC G ACT ACT AC T GCGCTOCAAA TCAATTAGCT 

GGTGGCTGGT ATTTOGACCC GAATTACTGG CTCTCTGTGG GTGCATATGC CATCTGGGGC 

CAGGGGACCC AGGTCACCGT CTCCTCACTA OTTACCCOTA CGACGTTCCG GACTACGGTT 
CTTAATAGAA TTC 
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) INFORMATION FOR SEQ ID NO:105: 

( i > SEQUENCE CHARACTERISTICS: 



( i i ) MOLECULE TYPE: cDNA 

( x i ) SEQUENCE DESCRIPTION: SEQ TD NO:105: 
CAGGTGAAAC TGCTCGAGTC TGGGGOAGGC TCGOTOCAGG CTGGGGGGTC TCTGACA' 
TCTTGTGT AT ACACCAACGA TACTGGGACC ATGGGATOGT TTCGCCAGGC TCCAGGG. 
GAGTGCGAAA OGGTCGCOCA TATTACGCCT GATOGTATGA CCTTCATTGA TGAACCC< 



AGTCTGAGGC CTGAGGACAC GOCCGTOTAT TACTGTGCGG CAGATTGOAA ATACTGGACT 
TGTGGTGCCC AGACTGGAGG ATACTTCOGA CAGTOOGOTC AGGGGGCCCA GGTCACCGTC 
TCCTCACTAG CTAGTTACCC GTACGACGTT CCGOACTACG GTTCTTAATA GAATTC 

( 2 ) INFORMATION FOR SEQ ID NO:106: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 361 hue pars 
( B ) TYPE: nucleic add 
( C ) STRANDEDNBSS: angle 
( D ) TOPOLOGY: linear 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

CTCGAGTCTG GGGGAGGCTC GGTCCAACCT GOAOGATCTC TOACACTCIC CTGTACAGTT 

TCTGGGGCCA CCTACAGTGA CTACAOTATT OGATGGATCC OCCAOGCTCC AGGGAAGGAC 

CGTGAAGTAG TCOCAOCCGC TAATACTOGT GCOACTAGTA AATTCTACOT CGACTTTOTG 

AAGGGCCGAT TCACCATTTC CCAAGACAAC GCCAAGAATA COGTATATCT GCAAATGAGC 

TTCCTGAAAC CTGAGGACAC GOCCATCTAT TACTGTGCGG CAGCGGACCC AAGTATATAT 

TATAGTATCC TCCATTGAOT ATAAGTACTO OGOCCAGGOG ACCCAGGTCA CCOTCTCCTC 



(C) 

( i i (MOLECULE TYPE: cDNA 

(xi ) SEQUENCE DESCRIPTION: SEQ TD NO: 107: 

CTCGAGTCAG GGGGAGGCTC GGTGGAGGCT OGAGOOTCTC TGAGACTCTC CTGTACAGCC 60 

TCTGGATACG TATCCTCTAT GGCCTGGTTC COCCAGGTTC CAGGOCAGGA GCGCGAOGGG 120 

GTCGCGTTTG TTCAAACGGC TGACAATAGT GCATTATATG OCGACTCCOT OAAOGGCCGA 180 

TTCACCATCT CCCACGACAA CGCCAAGAAC ACOCTOTATC TOCAAATOCO CAACCTGCAA 240 

CCTGACGACA CTGGCGTGTA CTACTOTGCG OCCCAAAAGA AGGATCGTAC TAGATGGGCC 300 

GAGCCTCGAO AATGOAACAA CTGGGGCCAG GQQACCCAGG TCACCOTCTC CTCA 334 
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) INFORMATION FOR SEQ ID NO:108: 



( D ) TOPOLOGY: hmm 

( i i ) MOLECULE TYPE: cDNA 

( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:108: 

CTCOAOTCAG OTGTCCGOTC TGATGTOCAG CTOGTGGCGT CTOOOOOAOO CTCGGTGCAG 

GCTGGAGGCT CTCTGAGACT CTCCTGTACA GCCTCTGGAO ACAOTTTCAG TAGATTTGCC 

ATOTCTTGGT TCCOCCAOOC TCCAOGOAAG GAOTGCGAAT TOOTCTCAAO CATTCAAAGT 

AATOOAAGGA CAACTGAOOC CGATTCCGTG CAAOGCCOAT TCACCATCTC CCOAGACAAT 

TCCAGGAACA CAGTGTATCT GCAAATGAAC AGCCTGAAAC CCOAGGACAC GGCCGTGTAT 

TACTGTGGGG CAOTCTCCCT AATGGACCGA ATTTCCCAAC ATGGOTOCCO GGGCCAGOGA 

ACCCAOGTCA CCGTCTCCTT A 



) INFORMATION FOR SEQ ID MO: 109: 




Loo Pro Pi 



( 2 ) INFORMATION FOR SBQ ID NO-.110: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 18 amino acidi 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: haeac 

( i i ) MOLECULE TYPE: peptide 

(I i ) SEQUENCE DESCRIPTION: SBQ ID NO:llO: 

Gly Oil Pro Ar« Oil Pro Ola Val Tyr Thr Loo Pro Pro Ser Arg Glo 
1 5 10 IS 

Glo Met 




Gly Gli Pro Al| Glu Pro Gla Val Tyr Thr Lett Pro Pro Sor Ola Oil 
1 5 10 15 

Gin Met 



( 2 ) INFORMATION FOR SBQ ID NO:112: 
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( D ) TOPOLOGY: lii 




( i i ) MOLECULE TYPE: peptide 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:113: 
Val Ala Olj Pro Ser Val 



( 2 ) INFORMATION FOR SEQ ID NCfclH: 

( i ) SEQUENCE CHARACTERISTICS: 



< i i )M 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NChlM: 
Ph. Lea Oly Oly Pro Ser Val P h e 

Thi Leo Met lie Ser Arg Thr Pro 



) INFORMATION FOR SEQ ID NOU15: 



( D ) TOPOLOGY: Ira 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO: 11 V 

Lyi Oly Oil Pro Arg Ola Pro Ola Val Tyr Thr Leu Pro Pro Ser Arg 
1 5 10 15 
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O 1 u Pro Gin 



( 2 )INPORMAnONFORSEQIDNO:117: 



( A ) LENGTH: 19 am 



( i i > MOLECULE TYPE: peptide 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:117: 

Ly« Gly Gin Pro Arg Gin Pro Gin Val Tyr T h r 
1 5 10 

Gin Oil Met 
( 2 ) INFORMATION FOR SEQ ID NO: 118: 



( B ) TYPE; anju acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( I i ) SEQUENCE DESCRIPTION: SBQ ID NO: 11 8: 

Oil Val Lyi Leu Val GIo Ser Gly Gly Gly Leo Val Olo Pro Oly Olj 
1 5 10 15 

Set Les Arg Len Ser Cyi Ala Tbr Ser Oly Pit Tbr Phe Ser 



(A)LE 
(B )Ti 
( D ) TOPOLOGY: linear 



al Oln Len Len Ser Oly Oly Oly 



( 2 ) INFORMATION FOR SEQ ID NO:120: 



( B ) TYPE: amino ac 
( D ) TOPOLOGY: hi 



) SEQUENCE DESCRIPTION: SEQ ID NO:120: 



( 2 ) INFORMATION FOR SEQ ID NO:121: 

( i ) SEQUENCE CHARACTERISTICS: 
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) SEQUENCE DESCRIPTION: SEQ ID NO:121: 



( 2 ) INFORMATION FOR SEQ ID NO:122: 



( i i ) MOLECULE TYPE: peptide 
( I i ) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
Trp Gly Oil Gly Thr Thr Leo Th 



( i i ) MOLECULE TYPE: peptide 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:123: 
Trp Oly Ola Gly Tht Set V.l Thr V» 1 Se 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 amino adds 
( B ) TYPE: amino acid 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
Tip Oly Ala Oly Tht Tht Val Th 



( 2 ) INFORMATION FOR SEQ ID NO:l25: 



( D ) TOPOLOGY: linear 



) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

p Tyt Tyt Gly Ser Ser Tyr Pbe Ajp Val Tip Gly Ala Oly Tht Tht 



( 2 ) INFORMATION FOR SEQ ID 



( A ) LENGTH: 67 amino adds 
( B ) TYPE: ammo acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 
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( D ) TOPOLOGY: lirear 
( i i ) MOLECULE TYPE: peptide 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 
Tip Gly Arg Gly Tbi Leu Val Thr Vil Sei Ser 



We claim: 

1. A cDNA library comprising nucleotide sequences cod- 
ing for a heavy-chain immunoglobulin comprising two 
heavy polypeptide chains, each heavy chain consisting of a 15 
complete antigen binding site, said immunoglobulin con- 
taining a variable (V^) region and a constant region, said 
constant region being devoid of first constant domain C H l, 
wherein said immunoglobulin is devoid of polypeptide light 
chains, obtained by performing the following steps: 20 

(a) obtaining B lymphocytes from a biological sample 
containing lymphoid cells, wherein said biological 
sample is obtained from a Camelid; 

(b) separating polyadenylated RNA from other nucleic 25 
acids and components of the B lymphocytes; 

(c) reacting the obtained RNA with a reverse transcriptase 
in order to obtain the corresponding cDNA; 

(d) contacting the obtained cDNA with 5* primers corre- 
sponding to mouse \ H domain of four-chain 30 
immunoglobulins, which primer contains a determined 
restriction site, and with 3' primers corresponding to the 
N-terminal part of a C H 2 domain; 

(e) anrolifying the DNA; 

(f) cloning the amplified DNA in a vector; and 

(g) recovering the clones hybridizing with a probe corre- 
sponding to the sequence coding for a constant domain 
of an isolated heavy-chain immunoglobulin. 

2. A cDNA library comprising nucleotide sequences cod- m 
ing for a heavy-chain immunoglobulin comprising two 
heavy polypeptide chains, each heavy chain consisting of a 
complete antigen binding site, said immunoglobulin con- 
taining a variable (VJ region and a constant region, said 
constant region being devoid of first constant domain C^l, 45 
wherein said immunoglobulin is devoid of polypeptide light 
chains, obtained by performing the following steps: 



(a) obtaining B lymphocytes from a biological sample 
containing lymphoid cells, wherein said biological 
sample is obtained from a Camelid; 

(b) separating polyadenylated RNA from other nucleic 
acids and components of the B lymphocytes; 

(c) reacting the obtained RNA with a reverse transcriptase 
in order to obtain the corresponding cDNA; 

(d) contacting the obtained cDNA with 5' primers located 
in the promoter, leader or framework sequences of the 
V HH sequence of a heavy chain immunoglobulin, 
which primer contains a determined restriction site, and 
with the 3' primers located in (he hinge, C H 2, C^3, 3' 
untranslated region or polyA tail; 

(e) amplifying the DNA; 

(f) cloning the amplified DNA in a vector; and 

(g) recovering the obtained clones. 

3. The cDNA library according to claim 2, wherein the 
B-lymphocytes of step (a) are obtained from an animal 
previously immunized against a determined antigen and the 
clones recovered in step (g) encode polypeptide chains 
having a preselected specificity for the antigen used for 
immunization. 

4. The cDNA library according to claim 1 or claim 2, 
wherein said lymphoid cells are selected from the group 
consisting of peripheral lymphocytes, spleen cells, lymph 
nodes, and other lymphoid tissue. 

5. The cDNA library according to claim 1 or claim 2, 
wherein in step (d) said restriction site is an Xhol site. 

6. The cDNA library according to claim 1 or claim 2, 
wherein in step (f) said vector is a bluescript vector. 
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ABSTRACT 



There is provided an isolated immunoglobulin comprising 
two heavy polypeptide chains sufficient for the formation of 
a complete antigen binding site or several antigen binding 
sites, wherein the immunoglobulin is further devoid of light 
polypeptide chains. 
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DR010CI5 C TCGA6 — TCTGGGGGA6G 

DR2700I C TCGAG— TCTGGGGGAGG 

DR030C5 C AGGTGA AACTGCTCGAG TCTGGAGGAGG 

DR110C6 C TCGAG— TCTGGGGGAGG 

DR2i0C 5 C AGGTGA AACTGCTCGAG— TCTGGGGGAGG 

DR160C- C — - TCGAG— TCTGGAGGAGG 

DR1900 1 C TCGAG— TCTGGAGGAGG 

DR07005 C TCGAG— TCTGGGGGAGG 

BRI6OO0 C TCGAG— TCTGGGGGAGG 

DR20005 C TCGAG— TCAGGGGGAGG 

DR25005 C TCGAG— TCTGGGGGAGG 

DR20005 C TCGAG— TCTGGAGGAGG 

DR21005 C TCGAG— TCTGGGGGAGG 

DR09005 C AGGTGA AACTGCTCGAG — TCTGGGGGAGG 

DR17005 C TCGAG— TCTGGGGGAGG 

DR13005 C TCGAG— TCAGGGGGAGG 

DR02005 CTCGAGTCAGGTGTCCGGTCTGATGTGCAGCTGGTGGCGTCTGGGGGAGG 

DR01005 ATCGGTGCAGGCTGGAGGGTCTCTGAGACTCTC--GTGCG-CAG^ 

DR27006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTGCATCTTCTTCT^ 

DRO3005 CTCGGTGCAGACTGGAGGATCTCTGAGACTCTCCTGTGCACT^ 

DR11006 GTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTAATGT-C-TCTG 

DR24006 GTCGGTGCAGGCTGGAGGGTCTCTGA6ACTCTCCTGTAATGT--C-TCT6 

DR16006 CTCGGCGCAGGCTGGAGGATCTCTGAGACTCTCCTGTGCAGC--CCACGG 

DR19006 CTCGGTTCAGGCTGGAGGGTCCCTTAGACTCTCCTGTGCAGC--C-TCTG 

DR07006 CTCGGTGCAGGGTGGAGGGTCTCTGAGACTCTCCTGTGCAA TCTCTG 

DR16006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTACAG ££TCTG 

DR20006 CTCGGTACAGGTTGGAGGGTCTCTGAGACTCTCCTGTGTAG---CCTa 

DR25006 CTCGGTACAAACTGGAGGGTCTCTGAGACTCTCTTGCG— AAATCTCTG 

DR20006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTG---TAGCCTCTG 

DR21006 CTCGGTGCAGGTTGGAGGGTCTCTGAAACTCJCCTGT^ 

DR09006 CTCGGTGCAGGCTGGGGGGTCTCTGACACTCTCTTGTG---TATACAC^ 

DR17006 CTCGGTCCAACCTGGAGGATCTCTGACACTCTCCTGTAW 

DR13006 CTCGGTGGAGGCTGGAGGGTCTCTGAGACTCTCCTGTACAG CCTCTG 

DR02006 CTCGGTGCAGGCTGGAGGCTCTCTGAGACTCTCCTGTACAG — CCTCTG 

DR01006 GA--TACAGTAATT — GTCCCCTCACTTG-GAGCTGGTATCGCCAGTTT 

DR27006 AA— TATATGCCTT — GCACCTACGACAT-GACCTGGTACCG^ 

DR03006 GA--TTCTCCTTTA---GTACCAGn6TAT-GGCCTGGnCCK 

DR11006 GC— TCTCCCAGTA — GTACTTATTGCCT-GGGCTGGTTCCGCCAGGCT 

DR24006 GC--TCTCCCAGTA — GTACTTATTGCCT-GGGCTGGTTCCGCCAGGCT 

DR16006 GA--TTCCGC-TCA — ATGGTTACTACAT-CGCCTGGTTCCGTCAGGCT 

DR19006 AC--TACACCATCA — CTGATTATTGCAT-GGCCTGGTTCCGCCAGGCT 

DR07006 GA--TACACGTACG — GTAGCTTCTGTAT-GGGCTGGTTCCGCGAGGGT 

DR16006 GA--TTCCCCTATA — GTACCTTCTGTCT-GGGGTGGTTCCGCCAGGCT 

DR20006 CT--CACACCGACA — GTAGCACCTGTAT-AGGCTGGnCCGCCAGG^ 

DR25006 GA--TTGACTTTTG — ATGATTCTGACGT-GGGGTGGTACCGCCAGGCT 

DR20006 GA--TTCAATTTCG — AAACTTCTCGTAT-GGCGTGGTACCGCCAGACT 

DR21006 GAGGTACCCCAGATCGTGTTCCTAAATCTTTGGCCTGGTTCCG^ 

DR09006 CAACGATACTGGGACCA TGGGATGGTTTCGCCAGGCT 

DR17006 "GGGCCACCTACA---GTGACTACAGTATTGGA-TC 

DR13006 G ATACGTAT-CCT CTATGGCCTGGTTCCGCCAGGTT 

DR02006 GAGA CAGTTTCAGTAGATT--TGCCATGTCTTGGTTCCGCCAGGCT 

FIG. 7 A 
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DR010C6 
DR270C6 
DR030C6 
DR110C6 
DR240C6 
DR160C6 
DR190C6 
DR07006 
DR160C6 
DR200C6 
DR250C6 
DR200C6 
DR210C6 
DR09006 
DR17006 
DR13006 
DR02006 

DR010C6 
DR27006 
DR030C6 
DR110C6 
DR240C6 
DR160D6 
DR190C6 
DR07006 
DR16006 
DR20006 
DR25006 
DR20006 
DR21006 
DR09006 
DR17006 
DR13006 
DR02006 

DR01006 
DR27006 
DR03006 
DR11006 
DR24006 
DR16006 
DR19006 
DR07006 
DR16006 
DR20006 
DR25006 
DR20006 
DR21006 
DR09006 
DR17006 
DR13006 
DR02006 



CCAGGAAC6GAGCGC6A6TTCGTCTCCAGTATGGATCCGGATGGAAATAC 



CCAGGCAAGGAGCGCGAATTTG' 
TCAGGAAAGCAGCGTGAGGGGG' 
CCAGGGAGGGAGCGTGAGGGGG' 
CCAGGGAAGGAGCGTGAGGGGG' 
CCTGGGAAGGGGCGTGAGGGGG' 
CCAGGGAAGGAGCGTGAATTGG' 



TCTCAAGTATAAATATTGATGGTAAGAC 
"CGCAGCCATTAATAGTGGCGGTGGTAG 

TCACAGCGATTAA CACTGATG6 

"CACAGCGATTAA CACTGATGG 

"CGCAACAATTAATGGTGGTCG 

^..,„ "CGCAGCGATTCAAGTTGTCCGTAGTGA 

CCAGGCAAGGAACGTGAGGGGATCGCAACTATTCTTAATGGTGGTACTAA 
CCAGGGAAGGAGCGTGAGGGGGTCGCGGGTATTAATAGTGCAGGAGGTAA 
CCAGGGAAGGAGCGCGAGGGGGTCGCAAGTATATATTTTGGTGATGGTGG 
CCAGGGCATGAGTGCAAATTGGTCTCAGGTATTCTGAGTGATGGTACT-C 
CCAGGAAATGTGTGTGAGTTGGTCTCAAGTATTTACAGTGATGG— --- 
CCAGAGAAGGAGCGCGAGGGGATCGCAGTTCTTTCGACTAAGGATGGTAA 
CCAGGGAAAGAGTGCGAAAGGGTCGCGCATATTACGCCTGATGGTATGA- 

CCAGGGAAGGACCGTGAAGTAGTCGCAGCCGCTAATACTGGTG 

CCAGGGCAGGAGCGCGAGGGGGTCGCGTTTGTTCAAACGG ---- 

CCAGGGAAGGAGTGCGAATTGGTCTCAAGCATTCAAAGTAATGGAAGGAC 

CAAGTACA CATACTCCGTGAAGGGCCGCTTCACC 

AACATACG CAGACTCCGTGAAGGGCCGATTCACC 

GACATACTA-CAACACATATGTCGCCGAGTCCGTGAAGGGCCGATTCGCC 

CAGTATCAT-ATACGCA GCCGACTCCGTGAAGGGCCGATTCACC 

CAGTGTCAT-ATACGCA GCCGACTCCGTGAAGGGCCGATTCACC 

CGA-CGTCACATACTACGCCGACTCCGTGACGGGCCGATTTACC 

TACT— -CGC-C-TCACAGACTACGCCGACTCCGTGAAGGGACGATTCACC 

CACATACTATGCCGACTCGGTGAAGGGCCGATTCACC 

TACTTACTATGCCGACGCCGTGAAGGGCCGATTCACC 

TACGAATTATCGCGACTCCGTGAAGGGCCGATTCACC 

CATATACAAAGAGTGGAGACTATGCTGAGTCT6TGAGGGGCCGGGTTACC 

CA-AAACATACTACGTCGACC--GCA TGAAGGGCCGATTCACC 

GA CATTCTATGCCGACTCCGTGAAGGGCCGATTCACC 

CCTTCATTGATGAACCCGTGAAGGGGCGATTCACG 

CGACTAGTAAATTCTACGTCGACTTTGTGAAGGGCCGATTCACC 

--CTGACAAT-AGTGCATTATATGGCGACTCCGTGAAGGGCCGATTCACC 
AACTGA GGCCGATTCCGTGCAAGGCCGATTCACC 

ATGTCCCGAGGCAGCACCGAGTACACAGTATTTCTGCAAATGGACAATCT 
ATCTCCCAAGACAGCGCCAAGAACACGGTGTATCTGCAGATGAACAGCCT 
ATCTCCCAAGACAACGCCAAGACCACGGTATATCTTGATATGAACAACCT 
ATCTCCCAAGACACCGCCAAGGAAACGGTACATCTCCAGATGAACAACCT 
ATCTCCCAAGACACCGCCAAGAAAACGGTATATCTCCAGATGAACAACCT 
ATCTCCCGAGACAGCCCCAAGAATACGGTGTATCTGCAGATGAACAGCCT 
ATCTCCCAAGGCAACACCAAGAACACAGTGAATCTGCAAATGAACAGCCT 
ATCTCCCAAGACAGCACGTTGAAGACGATGTATCTGCTAATGAACAACCT 
ATCTCCCAAGGGAATGCCAAGAATACGGTGTTTCTGCAAATGGATAACTT 
ATCTCCCAACTCAACGCCCAGAACACAGTGTATCTGCAAATGAACAGCCT 
ATCTCCAGAGACAACGCCAAGAACATGATATACCTTCAAATGAACGACCT 
ATTTCTAGAGAGAATGCCAAGAATACATTGTATCTACAACTGAGCGGCCT 
ATCTTCTTAGATAATGACAAGACCACTTTCTCCTTACAACTTGATCGACT 
ATCTCCCGAGACAACGCCCAGAAAACGTTGTCTTTGCGAATGAATAGTCT 
ATTTCCCAAGACAACGCCAAGAATACGGTATATCTGCAAATGAGCTTCCT 
ATCTCCCACGACAACGCCAAGAACACGCTGTATCTGCAAATGCGCAACCT 
ATCTCCCGAGACAATTCCAGGAACACAGTGTATCTGCAAATGAACAGCCT 

FIG. 7B 



U.S. Patent Nov. 24, 1998 Sheet 10 of 12 



5,840,526 



DR01006 GAAACCTGAGGACACGGCGATGTATTACTGTAAAAC-A — GCCCTAC-- 

DR27006 GAAACCTGAGGACACGGCGATGTATTACTGTAAAAT-A — GA--TTC-- 

DR05006 AACCCCTGAAGACACGGCTACGTATTACTGTGCGGCGG — TCCCAGCCC 

DR11006 GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA — GACTGACGG 

DR24006 GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA — GACTGACGG 

DR16006 GAAACCTGAGGACACGGCCATCTACTTCTGTGCAGCAG — G CTC 

DR19006 GACACCTGAGGACACGGCCATCTACAGTTGTGCGGCAA — C CAG 

DR07006 GAAACCTGAAGACACGGGCACCTATTACTGTGCTG-CA — GAACTAAGT 

DR16006 GAAACCTGAGGACACGGCCATCTATTACTGCGCGG-CG — GATAGTCCA 

DR20006 GAAACCTGAGGACAGCGCCATGTACTACTGTGCAATCA — CTGAAATTG 

DR25006 GAAACCTGAGGACACGGCCATGTATTACTGCGCGGTAGATGGTTGGACCC 

DR20006 CAAACCTGAGGACACGGCCATGTATTACTGTGCG CC 

DR21006 GAACCCGGAGGACACTGCCGACTACTACTGCGCTGCAAATCAATTAGC-- 

DR09006 GAGGCCTGAGGACACGGCCGTGTATTACTGTGCGGCAGATTG 

DR17006 GAAACCTGAGGACACGGCCATCTATTACTGTGCGGCAG CGGACCC 

DR13006 GCAACCTGACGACACTGGCGTGTACTACTGTGCGGCC CAA 

DR02006 GAAACCCGAGGACACGGCCGTGTATTACTGTGGGGCAGT 

DR01006 A-AC— CTGGGGGTTATTGTGGGTA- 

DR27006 GTAC-CCGTGCCATCTCCTTGATG- 

DR03006 ACTTGGGACCT GGCG-CCATT CTTGATTTG 

DR11006 AGATGGGGGCTTGTGATGCGAGATGGGCGACCTTAGC--GACAAGGAC-G 

DR2a006 AGATGGGGGCTTGTGATGCGAGATGGGCGACCTTAGC--GACAAGGAC-G 

DR16006 GCGTTTTT-CTAGTCCTGTTGGGAGCACTTC-TAGAC — TCGAAAGTAG 

DR19006 TAGTTTTTACTGGTACT GCAC C— ACG G 

DR07006 GGTGGTAGTTGTGAATTGC — CTTTGC TATTTGACTA 

DR16006 TGTTACATGCCGACTATGC— CCGCTCCCCCGATACGAGACAGTTTTGG 

DR20006 AGTGGTATGGGTGCAATTT — AAGGACTACTTTTACT — C G 

DR25006 GGAAGGAAG--GGGGAATCGGGTTAC CCTGGTCGGTCCAATGTGAA 

DR20006 GGTTGAA TATC CTATTGCAGAC--ATGTGTT 

DR21006 — TGGTGGCTGGTATT TGGACCCGAATTACTGG-CTCTCTGTG 

DR09006 — GAAATACTGGA CTTGTGGTGC--CCAGA-CTGG AG 

DR17006 AAGTATATATTATAGTATC CTCCNNAT 

DR13006 AAGAAGGATCGTA CTAGATGGGC CGAGCCT 

DR02006 CTCCCTAA— TGGACCGAATTTC 

DR01006 — TGGGTANTGCCTCTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR27006 --T CTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR03006 AAAAAGTATAAGTACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR11006 TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCTCCTCACT 

DR24006 TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCTCCTCACT 

DR16006 CGA-CT-ATAACTATTGGGGCCAGGGGATCCAGGTCACCGTCACCTCACT 

DR19006 CGC-CTTATAACGTCTGGGGTCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR07006 CTGGG GCCAGGGCACCCAGGTCACCGTCTCCTCACT 

DR16006 CTGGGATGATTTT GGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR20006 CTGGG- GCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR25006 GATGGTTATAACTATTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCAC- 

DR20006 CGAGAT ACG — GCGACCCGGGGACCCAGGTCACCGTCTCCTCAC- 

DR21006 GGTGCATATGCCATCTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCAC- 

DR09006 GATACTTCGGACAG-TGGGGTCAGGGGGCCCAGGTCACCGTCTCCTCACT 

DR17006 --TGAGTATAAGTACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCA-- 

DR13006 CGAGAATGGAACAACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCA-- 

DR02006 CCAACATGGG--TGCCGGGGCCAGGGAACCCAGGTCACCGTCTCCT 
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8B§JRBI TTACCCGTACGACGTTCCGGACTAC6GTTCTTAATAGAATTC 
^"TinKCCGTACGAGCTTCCGGACTACGGTTCTTAATAGAATTC 

SSiinS? ^CTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 
AG----TTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

SSIonSc '"" {gttacccgtacgacgttccggactacggttcttaatagaattc 
BSSSSI — -agttacccgtacgacgttccggactacggttcttaatagaattc 

889£8B£ — iSJACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

n§9nnn^ {GTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

8K288£ — "AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

8S£ TJGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

8B?98R£ JAGTTACCCGTACGACGAACCGGACTACGGTTCTTAATAGAATTC 

nSnonSt 7;:J^TTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR17006 ^^ AGTTACCCGTACGACGTTCCGGACTA CGGTTCTTAATAGAATTC 

DR13006 --------~--"-"--""""ii"iiz::::::::"":: _ " 

DR02006 T jj 

FIG. 7D 
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This is a division of application Ser. No. 08/106,944, 
filed Aug. 17, 1993, now abandoned. 

The invention relates to new isolated immunoglobulins 
which are devoid of light polypeptide chains. These immu- 
noglobulins do not consist in the degradation products of 
immunoglobulins composed of both heavy polypeptide and 
light polypeptide chains but to the contrary, the invention 
defines a new member of the family of the 
immunoglobulins, especially a new type of molecules 
capable of being involved in the immune recognition. Such 
immunoglobulins can be used for several purposes, espe- 
cially for diagnosis or therapeutical purposes including 
protection against pathological agents or regulation of the 
expression or activity of proteins. 

Up to now the structure proposed for immunoglobulins 
consists of a four-chain model referring to the presence of 
two identical light polypeptide chains (light chains) and two 
identical heavy polypeptide chains (heavy chains) linked 
together by disulfide bonds to form a v- or T-shaped mac- 
romolecules. These chains are composed of a constant 
region and a variable region, the constant region being 
subdivided in several domains. The two heavy polypeptide 
chains are usually linked by disulphidc bounds in a so-called 
"hinge region" situated between the first and second 
domains of the constant region. 

Among the proteins forming the class of the 
immunoglobulins, most of them arc antibodies and accord- 
ingly present an antigen binding site or several antigen 
binding sites. 

According to the four-chain model, the antigen binding 
site of an antibody is located in the variable domains of each 
of the heavy and light chains, and requires the association of 
the heavy and the light chains variable domains. 

for the definition of these four-chain model 
immunoglobulins, reference is made to Roitt. I et al 
(Immunology-second-Edition Gower Medical Publishing 
U.S.A., 1989). Reference is especially made to the part 
concerning the definition of the four-chain 
immunoglobulins, their polypeptide and genetic structures, 
the definition of their variable and constant regions and the 
obtention of the fragments produced by enzymatic degra- 
dation according to well known techniques. 

The inventors have surprisingly established that different 
molecules can be isolated from animals which naturally 
produce them, which molecules have functional properties 
of immunoglobulins these functions being in some cases 
related to structural elements which are distinct from those 
involved in the fund inn of four-chain immunoglobulins due 
for instance to the absence of light chains. 

The invention relates to two-chain model immunoglo- 
bulins which neither correspond to fragments obtained for 
instance by the degradation in particular the enzymatic 
degradation of a natural four-chain model immunoglobulin, 
nor correspond to the expression in host cells, of DNA 
coding for the constant or the variable region of a natural 
four-chain model immunoglobulin or a part of these regions, 
nor correspond to antibodies produced in lymphopaties for 
example in mice, rats or human. 

E. S. Ward et al (1) have described some experiments 
performed on variable domains of heavy polypeptide chains 
(Vh) or / an d light polypeptide chains (V A yF,,) to test the 
ability of these variable domains, to bind specific antigens. 
For this purpose, a library of genes was prepared from 
the spleen genomic DNA of ice previously immunized with 
these specific antigens. 



Ward et al have described in their publication that V H 
domains are relatively sticky, presumably due to the exposed 
hydrophobic surface normally capped by the V A - or V x 
domains. They consequently envisage that it should be 
possible to design domains having improved properties 
and further that domains with binding activities could 
serve as the building blocks for making variable fragments 
(Fv fragments) or complete antibodies. 

The invention does not start from the idea that the 

J different fragments (light and heav\ chains) and the different 
domains of these fragments of four-chain model immuno- 
globulin can be modified to define new or improved antigen 
binding sites or a four-chain model immunoglobulin. 

The inventors have determined that immunoglobulins 

5 can have a different structure than the known four-chain 
model and that such different immunoglobulins offer new 
means for the preparation of diagnosis reagents, therapeu- 
tical agents or any other reagent for use in research or 
industrial purposes. 

D Thus the invention provides new immunoglobulins 
which are capable of showing functional properties of 
four-chain model immunoglobulins although their structure 
appears to be more appropriate in many circumstances for 
their use, their preparation and in some cases for their 

^ modification. Moreover these molecules can be considered 
as lead structures for the modification of other immunoglo- 
bulins. The advantages which are provided by these immu- 
noglobulins comprise the possibility to prepare them with an 
increased facility. 

The invention accordingly relates to immunoglobulins 
characterized in that they comprise two heavy polypeptide 
chains sufficient for the formation of a complete antigen 
binding site or several antigen binding sites, these immu- 
noglobulins being further devoid of light polypeptide chains. 

5 In a particular embodiment of the invention, these immu- 
noglobulins are further characterized by the fact that they are 
the product of the expression in a prokaryotic or in a 
eukaryotic host cell, of a DNA or of a cDNA having the 
sequence of an immunoglobulin devoid of light chains as 

3 obtainable from lymphocytes or other cells of Camelids. 
The immunoglobulins of the invention can be obtained 
for example from the sequences which are described in FIG. 



The immunoglobulins of the invention, which are devoid 
of fight chains are such that the variable domains of their 
heavy chains have properties differing from those of the 
four-chain immunoglobulin V,,. The variable domain of a 
heavy-chain immunoglobulin of the invention has no normal 
interaction sites with the V L or with the C H 1 domain which 
do not exist in the heavy chain immunoglobulins. It is hence 
a novel fragment in many of its properties such as solubility 
and position of the binding site. For clarity reasons we will 
call it W HH in this text to distinguish it from the classical 
of four-chain immunoglobulins. 

By "a complete antigen binding site" it is meant accord- 
ing to the invention, a site which will alone allow the 
recognition and complete binding of an antigen. This could 
be verified by any known method regarding the testing of the 
binding affinity. 

These immunoglobulins which can be prepared by the 
technique of recombinant DNA, or isolated from animals, 
will be sometimes called "heavy-chain immunoglobulins" in 
the following pages. In a preferred embodiment of the 
invention, these immunoglobulins are in a pure form. 

In a first embodiment, the immunoglobulins of the inven- 
tion are obtainable in prokaryotic cells, especially in E. coli 
cells by a process comprising the steps of: 



a) cloning in a Bluecript vector of a DNA or cDNA 
sequence coding for the V ff „ domain of an immuno- 
globulin devoid of light chain obtainable for instance 
from lymphocytes of Camelids, 

b) recovering the cloned fragment after amplification 5 
using a 5' primer containing an Xho site and a 3' primer 
containing the Spe site having the following sequence 
TC TTA ACT ACT GAG GAG ACG GTG ACC TG, 
SEQ ID NO: 51 

c) cloning the recovered fragment in phase in the immuno J() 
PBS vector after digestion of the vector with Xho and 
Spe restriction enzymes, 

d) transforming host cells, especially E. coli by transfec- 
tion with the recombinant immuno PBS vector of step 

c, 15 

e) recovering the expression product of the V HH coding 
sequence, for instance by using antibodies raised 
against the dromadary V HH domain. 

In another embodiment [lie immunoglobulins ai\ hctcro- 
specific immunoglobulins obtainable by a process compris- 20 
ing the steps of: 

obtaining a first DNA or cDNA sequence coding for a 
V HH domain or part thereof having a determined speci- 
ficity against a given antigen and comprised between 
Xho and Spe sites, 25 

obtaining a second DNA or cDNA sequence coding for a 
~V HH domain or part thereof, having a determined 
specificity different from the specificity of the first 
DNA or cDNA sequence and comprised between the 
Spe and EcoRI sites, 30 

digesting an immuno PBS vector with EcoRI and Xhol 
restriction enzymes, 

ligating the obtained DNA or cDNA sequences coding for 
V HH domains, so that the DNAor cl)N A sequences arc 
serially cloned in the vector, 

transforming a host cell, especially E. coli cell by 
transfection, and recovering the obtained immunoglo- 
bulins. 

In another embodiment, the immunoglobulins are obtain- ^ 
able by a process comprising the steps of: 

obtaining a DNA or cDNA sequence coiling for a \ lm 

domain or part thereof, having a determined specific 

antigen binding site, 
amplifying the obtained DNAor cDNA, using a 5' primer 4J 

containing an initiation codon and a Hindlll site, and a 

3' primer containing a termination codon having a Xhol 

site, 

recombining the amplified DNAor cDNAinto the Hindlll 
(position 2650) and Xhol (position 4067) sites of a 50 
plasmid pMM984, 

transfecting permissive cells especially NB-E cells with 
the recombinant plasmid, 

recovering the obtained products. 

Successful expression can be verified with antibodies 55 
directed against a region of a Y HH domain, especially by an 
ELISA assay. 

According to another particular embodiment of this 
process, the immunoglobulins are cloned in a parvovirus. 

In another example these immunoglobulins are obtainable 61 
by a process comprising the further cloning of a second 
DNAor cDNA sequence having another determined antigen 
binding site, in the pMM984 plasmid. 

Such an Immunoglobulin can be further characterized in 
that it is obtainable by a process wherein the vector is Yep 65 
52 and the transformed recombinant cell is a yeast especially 
S. cerevisiae. 



Aparticular Immunoglobulin is characterized in that it has 
a catalytic activity, especially in that it is directed against an 
antigen mimicking an activiated state of a given substrate. 
These catalytic antibodies can be modified at the level of 
their biding site, by random or directed mutagenesis in order 
to increase oe modify their catalytic function. Reference 
may be made to the publication of Lcrncr ct al (TIBS 
November 1987, 427-430) for the general technique for the 
preparation of such catalytic immunoglobulins. 

According to a preferred embodiment, the immunoglo- 
bulins of the invention are characterized in that their variable 
regions contain in position 45, an amino-acid which is 
different from leucine, proline or glutamine residue. 

Moreover the heavy-chain immunoglobulins are not prod- 
ucts characteristic of lymphocytes of animals nor from 
lymphocytes of a human patient suffering from lymphop- 
athies. Such immunoglobulins produced in lymphopathies 
are monoclonal in origin and result from pathogenic muta- 
tions at the genomic level. They have apparently no antigen 

The two heavy polypeptide chains of these immunoglo- 
bulins can be linked by a hinge region according to the 
definition of Roitt ct al. 

In a particular embodiment of the invention, immunoglo- 
bulins corresponding to the above -defined molecules are 
capable of acting as antibodies. 

The antigen binding site(s) of the immunoglobulins of the 
invention are located in the variable region of the heavy 

In a particular group of these immunoglobulins each 
heavy polypeptide chain contains one antigen binding site 
on its variable region, and these sites correspond to the same 



In a further embodiment of the ii 
globulins are characterized in that their heavy polypeptide 
chains contain a variable region (V^) and a constant region 
(C^) according to the definition of Roitt et al, but are devoid 
of the first domain of their constant region. This first domain 
of the constant region is called C^l. 

These immunoglobulins having no C^l domain are such 
that the variable region of their chains is directly linked to 
the hinge region at the ("-terminal part of the variable region. 

The immunoglobulins of the type described here-above 
can comprise type G immunoglobulins and especially 
immunoglobulins w hich arc defined as immunoglobulins of 
class 2 (lgG2) or immunoglobulins of class 3 (IgG3) 

The absence of the light chain and of the first constant 
domain lead to a modification of the nomenclature of the 
immunoglobulin fragments obtained by enzymatic 
digestion, according to Roitt et al. 

The terms Fc and pFc on the one hand, Fc' and pFc' on the 
other hand corresponding respectively to the papain and 
pepsin digestion fragments are maintained. 

The terms Fab F(ab) 2 F(ab') 2 Fabc, Fd and Fv are no 
longer applicable in their original sense as these fragments 
have either a light chain, the variable part of the light chain 
or the C^l domain. 

The fragments obtained by papain digestion and com- 
posed of the V HH domain and the hinge region will be called 
FV^h or F(V /;;/ !i) depending upon whether or not they 
remain linked by the disulphide bonds. 

In another embodiment of the invention, immunoglobu- 
lins replying to the hereabove given definitions can be 
originating from animals especially from animals of the 
camelid family. The inventors have found out that the 
heavy-chain immunoglobulins which are present in cam- 
elids are not associated with a pathological situation which 



would induce the production of abnormal antibodies with 
respect to the four-chain immunoglobulins. On the basis of 
a comparative study of old world camelids ( Camelus bac- 
trianus and Camelus dnmuiclcrms) and new world camelids 
(for example Lama paccos, Lama glama, and Lama ■ 
vicugna) the inventors have shown that the immunoglobu- 
lins of the invention, which are devoid of light polypeptide 
chains are found in all species. Nevertheless differences may 
be apparent in molecular weight of these immunoglobulins 
depending on the animals. Especially the molecular weight 
of a heavy chain contained in these immunoglobulins can be 
from approximately 43 kd to approximately 47 kd, in 
particular 45 kd. 

Advantageously the heavy-chain immunoglobulins of the ^ 
invention are secreted in blood of camelids. 

Immunoglobulins according to this particular embodi- 
ment of the invention are obtainable by purification from 
serum of camelids and a process for the purification is 
described in details in the examples. In the case where the 2 
immunoglobulins are obtained from Camelids, the invention 
relates to immunoglobulins which are not in their natural 
biological environment. 

According to the invention immunoglobulin IgG2 as 
obtainable by purification from the serum of camelids can be 2 
characterized in that: 

it is not adsorbed by chromatography on Protein G 
Sepharose column, 

it is adsorbed by chromatography on Protein A Sepharose : 

column, 

it has a molecular weight of around 10(1 kd after elution 
with a pH 4.5 buffer (0.15M NaCl, 0.58% acetic acid 
adjusted to pH 4.5 by NaOH), 
it consists of heavy |!2 polypeptide chains of a molecular ; 

weight of around 46 kd preferably 45 after reduction. 
According to a further embodiment of the invention 
another group of immunoglobulins corresponding to IgG3, 
as obtainable by purification from the serum of Camelids is 
characterized in that the immunoglobulin: 4 
is adsorbed by chromatography on a Protein A Sepharose 
column, 

has a molecular weight of around 100 kd after elution with 

a pH 3.5 buffer (0.15M NaCl, 0.58% acetic acid), 4 
is adsorbed by chromatography 011 a Protein G Sepharose 
column and eluted with pH 3.5 buffer (0.15M NaCl, 
0.58% acetic acid), 
consists of heavy y3 polypeptide chains of a molecular 
weight of around 45 Kd" in pan icnlar between 4^ and 47 = 
kd after reduction. 
The immunoglobulins of the invention which are devoid 
of light chains, nevertheless comprise on their heavy chains 
a constant region and a variable region. The constant region 
comprises different domains. 5 

The variable region of immunoglobulins of the invention 
comprises frameworks (FW) and complementarity deter- 
mining regions (CDR), especially 4 frameworks and 3 
complementarity regions. It is distinguished from the four- 
chain immunoglobulins especially by the fact that this t 
variable region can itself contain an antigen binding site or 
several, without contribution of the variable region of a light 
chain which is absent. 

The amino-acid sequences of frameworks 1 and 4 com- 
prise among others respectively amino-acid sequences e 
which can be selected from the following for the framework 
1 domain 



GGSVQTGGSLRLSCEISGLTFD SEQ ID NO:l 
GGSVQTGGSLRLSCAVSGFSFS SEQ ID NO:2 
GGSEQGGGSLRLSCAISGYTYG SEQ ID NO:3 
GGSVQPGGSLTLSCTVSGATYS SEQ ID NO:4 
GGSVQAGGSLRLSCTGSGFPYS SEQ ID NO:5 
GGSVQAGGSLRLSCVAGFGTS SEQ ID NO:6 



for the framework 4 domain 



WGQGTQVTVSS SEQ ID NO:8 
W G Q G T L V T V S S SEQ ID NO:9 
W G Q G A Q V T V S S SEQ ID NO:10 
WGQGTQVTASS SEQ ID NO:ll 
RGQGTQVTVSL SEQ ID NO:12 



for the CDR3 domain 



ALQPGGYCGYGX CLSEQIDNO:62 

VSLMDK1SQH GC SEQ ID NO:63 

VPAHLGPGAILDLKKY K Y SEQ ID NO:64 

FCYSTAGDGGSGE MY SEQ ID \0:65 

ELSGGSCELPLLF DY SEQ [D NO:66 

DWKYWTCGAQTGGYF GQ SEQ ID NO:67 

RLTEKGACDARWATLATRTFAYNY SEQ ID NO:68 

QKKDRTRWAEPREW N N SEQ ID NO:69 

GSRFSSPVGSTSRLES-SDY--NY SEQ ID NO:70 

A D P S I Y Y S I T. X I F. Y K Y SF.Q ID NO:71 

DSPCYMPTMPAPPIRDSFGW--DD SEQ ID NO:72 

TSSFYWYCTTAPY NV SEQ ID NO:73 

TEIEWYGCNLRTTF T R SEQ ID NO:74 

NQLAGGWYLDPNYWLSVGAY--AI SEQ ID >JO:75 
RLTEMGACDAR WATLATRTFAYNYSEQ ID NO:76 
DGWTRKEG G IG LPWSVQCEDGYNY SEQ ID NO:77 

DSYPCHLL DV SEQ ID NO:78 

VEYPIADMCS RY SEQ ID NO:79 



As stated above, the immunoglobulins of the ii 
are preferably devoid of the totality of their C^l domain. 

Such immunoglobulins comprise C H 2 and C^3 domains 
in the C-terminal region with respect to the hinge region. 

According to a particular embodiment of the invention the 
constant region of the immunoglobulins comprises C H 2 and 
C^3 domains comprising an amino-acid sequence selected 
from the following for the C H 2 domain: 



APELLGGPTVFIFPPKPKDVLSITLTP SEQ ID NO:3t 
APELPGGPSVFVFPTKPKDVLSISGRP SEQ ID NO:32 
APELPGGPSVFVFPPKPKDVLSISGRP SEQ ID NO:33 
APELLGGPSVFIFPPKPKDVLSISGRP SEQ ID NO:34 



for the C^3 domain: 



GQTREPQVYTLA SEQ ID NO:35 
GQTREPQVYTLAPXRLEL SEQ ID NO:36 
GQPREPQVYTLPPSRDEL SEQ ID NO:109 
GQPREPQVYTLPPSREEM SEQ ID NO: 110 
GQPREPQVYTLPPSQEEM SEQ ID NO:lll 

Interestingly the inventors have shown that the hinge 
region of the immunoglobulins of the invention can present 
variable lengths. When these immunoglobulins act as 
antibodies, the length of the hinge region will participate to 
the determination of the distance separating the antigen 
binding sites. 

Preferably an immunoglobulin according to the invention 
is characterized in that its hinge region comprises from 0 to 
50 amino-acids. 



Particular sequences of hinge region of the ir 
bulins of the invention are the following. 



EPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCP SEQ ID NO:38 

The short hinge region cor responds to an IgG3 molecule 
and the long hinge sequence corresponds to an IgG2 mol- 

Isolated V HH derived from heavy chain immunoglobulins 

or V,„, libraries corresponding to the heavy chain ii u- 

noglobulins can be distinguished from V HH cloning of 
four-chain model immunoglobulins on the basis of sequence 
features characterizing heavy chain immunoglobulins. 

The camel heavy-chain immunoglobulin V HH region 
shows a number of differences with the V HH regions derived 
from 4-chnin immunoglobulins from all species examined. 
At the levels of the residues involved in the V HH fV L 
interactions, an important difference is noted at the level of 
position 45 (FW) which is practically always leucine in the 
4-chain immunoglobulins (98%), the other amino acids at 
this position being proline (f %) or glutamine (1%). 

In the camel heavy-chain immunoglobulin, in the 
sequences examined at present, leucine at position 45 is only 
found once. It could originate from a four-chain immuno- 
globulin. In the other cases, it is replaced by arginine, 
cysteine or glutamic acid residue. The presence of charged 
amino acids at this position should contribute to making the 
more soluble. 

The replacement by camelid specific residues such as 
those of position 45 appears to be interesting for the con- 
struction of engineered V HH regions derived from the V HH 
repertoire of 4-chain immunoglobulins. 

A second feature specific of the camelid V HH domain is 
the frequent presence of a cysteine in the CDR 3 region 
associated with a cysteine in the CDR 1 position 31 or 33 or 
I'W, region al position 45 The possibility of establishing a 
disulphide bond between the CDR 3 region and the rest of the 
variable domain would contribute to the stability and posi- 
tioning of the binding site. 

With the exception of a single pathogenic myeloma 
protein (DAW) such a disulphide bond has never been 
encountered in immunoglobulin V regions derived from 4 
chain immunoglobulins. 

The heavy-chain immunoglobulins of the invention have 
further the particular advantage of being not sticky. Accord- 
ingly these immunoglobulins being present in the scrum, 
aggregate much less than isolated heavy chains of a four- 
chain immunoglobulins. The immunoglobulins of the inven- 
tion are soluble to a concentration above 0.5 mg/ml, pref- 
erably above 1 mg/ml and more advantageously above 2 

These immunoglobulins further bear an extensive antigen 
binding repertoire and undergo affinity and specificity matu- 
ration in vivo. Accordingly they allow the isolation and the 
preparation of antibodies having defined specificity, regard- 
ing determined antigens. 

Another interesting property of the immunoglobulins of 
the invention is that they can be modified and especially 
humanized. Especially it is possible to replace all or part of 
the constant region of these immunoglobulins by all or part 
of a constant region of a human antibody. For example the 
C H 2 and/or C H 3 domains of the immunoglobulin could be 
replaced by the C H 2 and or C H 3 domains of the IgG y3 
human immunoglobulin. 

In such humanized antibodies it is also possible to replace 
a part of the variable sequence, namely one or more of the 



framework residues which do not intervene in the binding 
site by human framework residues, or by a part of a human 
antibody. 

Conversely features (especially peptide fragments) of 
heavy-chain immunoglobulin V HH regions, could be intro- 
duced into the or V £ regions derived from four-chain 
immunoglobulins with for instance the aim of achieving 
greater solubility of the immunoglobulins. 

The invention further relates to a fragment of an immu- 
! noglobulin which has been described hereabove and espe- 
cially to a fragment selected from the following group: 
a fragment corresponding to one heavy polypeptide chain 

of an immunoglobulin devoid of light chains, 
fragments obtained by enzymatic digestion of the immu- 
' noglobulins of the invention, especially those obtained 
by partial digestion with papain leading to the Fc 
fragment (constant fragment) and leading to FV^h 
fragment (containing the antigen binding sites of the 
heavy chains) or its dimer F(V HH h) 2 , or a fragment 
1 obtained by further digestion with papain of the Fc 
fragment, leading to the pFc fragment corresponding to 
the C-lerminal part of the Fc fragment, 



r pro 



*>lytic 



a fragment of at least 10 preferably 20 amino acids of the 
variable region of the immunoglobulin, or the complete 
variable region, especially a fragment corresponding to 
the isolated \ lm domains or to the V /r/r dimcrs linked 
to the hinge disulphide, 

a fragment corresponding to the hinge region of the 
immunoglobulin, or to at least 6 amino acids of this 
hinge region, 

a fragment of the hinge region comprising a repeated 
sequence of Pro-X, 

a fragment corresponding to at least 10 preferably 20 
amino acids of the constant region or to the complete 
constant region of the immunoglobulin. 

The invention also relates to a fragment comprising a 
repeated sequence, Pro-X which repeated sequence contains 
at least 3 repeats of Pro-X, X being any amino-ackl and 
preferably Gin (glutamine), I as (lysine) or Glu (acide 
glutamique); a particular repeated fragment is composed of 
a 12-fold repeat of the sequence Pro-X. 

Such a fragment can be advantageously used as a link 
between different types of molecules. 

The amino- acids ol the Pro-X sequence are chosen among 
any natural or non natural amino-acids. 

The fragments can be obtained by enzymatic degradation 
of the immunoglobulins. They can also be obtained by 
expression in cells or organisms, of nucleotide sequence 
coding for the immunoglobulins, or they can be chemically 
synthesized. 

The invention also relates to anti-idiotypes antibodies 
belonging to the heavy chain immunoglobulin classes. Such 
anti-idiotypes can be produced against human or animal 
idiotypes. A property of these anti-idiotypes is that they can 
be used as idiotypic vaccines, in particular for vaccination 
against glycoproteins or glycolipids and where the carbo- 
hydrate determines the epitope. 

The invention also relates to anti-idiotypes capable of 
recognizing idiotypes of heavy-chain immunoglobulins. 

Such anti-idiotype antibodies can be either syngeneic 
antibodies or allogenic or xenogeneic antibodies. 

The invention also concerns nucleotide sequences coding 
for all or part of a protein which amino-acid sequence 
comprises a peptide sequence selected from the following: 
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GGSVQTGGSLRLSCEISGLTFD SEQ ID NO:l 
GGSVQTGGSLRLSCAVSGFSFS SEQ ID NO:2 
GGSEQGGGSLRLSCAISGYTYC SEQ ID NO:3 
GGSVQPGGSLTLSCTVSGATYS SEQ ID NO:4 
GGSVQAGGSLRLSCTGSGFPYS SEQ ID NO:S 
GGSVQAGGSLRLSCVAGFGTS SEQ ID NO:6 
G G S V Q A G G S I. R I S C V S F S P S S SEQ ID NO:7 
WGQGTQVTVSS SEQ ID NO:8 
WGQGTLVTVSS SEQ ID NO:9 
WGQGAQVTVSS SEQ ID NO:10 
WGQGTQVTASS SEQ ID NO:ll 
RGQGTQVTVSL SEQ ID NO:12 



FCYSTAGDGGSGE M Y SEQ II) NO:c? 

ELSGGSCELPLLF DY SEQ ID NO:66 

DWKYWTCGAQTGGYF G Q SEQ ID NO:67 

RLTEMGACDARWAT LAT R T F A Y N Y SEQ ID NO:68 

QKKDRTRWAEPREW N N SEQ ID NO:69 

GSRFSSPVGSTSRLES--SDY-NY SEQ ID NO:70 

ADPSIYYSILXIEY K Y SEQ ID NO:71 

DSPCYMPTMPAPPIRDSFGW--DD SEQ ID NO:72 

TSSFYWYCTTAPY NV SEQ ID NO:73 

TEIEWYGCNLRTTF TR SEQ ID NO:74 

NQLAGGWYLDPNYWLSVGAY - - A I SEQ ID NO:7S 
RLTEMGACDARWATLATRTFAYNY SEQ ID NO:76 
DGWTRKEGGIGLPWSVQCEDGYNY SEQ ID NO:77 

DSYPCHLL DV SEQ ID NO:78 

VEYPIADMCS RY SEQ ID NO:79 

APELLGGFSVFVFPPKPKDVLSISGXPK SEQ ID NO:39 
APELPGGPSVFVFPTKPKDVLSISGRPK SEQ ID NO:40 
APELPGGPSVFVFPPKPKDVLSISGRPK SEQ ID NO:41 
APELLGGPSVFIFPPKPKDVLSISGRPK SEQ ID NO:42 
GQTREPQVYTLAPXRLEL SEQ ID NO:36 
OQPREPQVYl'LPFSRDEL SEQ ID NQ:1U9 
GQPREPQVYTLPPSREEM SEQ ID NO:110 
GQPREPQVYTLPPSQEEM SEQ ID NO:lll 

VSSGTNEVCKCPKCPAPELPGGPSVFVFP SEQ ID NO:43 

VTVSSEPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCPAPELLGGPSVFIFP SEQ ID NO:44 
GTNEVCKCPKCP SEQ ID NO:37 
APELPGGPSVIV1 l> M O II) \Q:45 

M'KII'(JI'QrKI>()l>(.)l>()IV)l'KI>()l>KI>l PI < T< l>K( l> sliQ II) 
APELLGGPSVFIFP SEQ ID NO:46 



Such nucleotide sequences can be deduced from the 
amino-acid sequences taking into account the deneneracy of 
the genetic code. They can be synthesized or isolated from 
cells producing immunoglobulins of the invention. 

A procedure for the obtention of such DNA sequences is 
described in the examples. 

The invention also contemplates RNA, especially mRNA 
sequences corresponding to these DNA sequences, and also 
corresponding cDNA sequences. 

The nucleotide sequences of the invention can further be 
used for the preparation of primers appropriate for the 
detection in cells or screening of DNA or cDNA libraries to 
isolate nucleotide sequences coding for immunoglobulins of 
the invention. 

Such nucleotide sequences can be used for the preparation 
of recombinant vectors and the expression of these 
sequences contained in the vectors by host cells especially 
prokaryotic cells like bacteria or also eukaryotic cells and 
for example CHO cells, inseel cells, simian cells like Vero 
cells, or any other mammalian cells. Especially the fact that 
the immunoglobulins of the invention are devoid of light 
chains permits to secrete them in eukaryotic cells since there 
is no need to have recourse to the step consisting in the 
formation of the BIP protein which is required in the 
four-chain immunoglobulins. 

The inadequacies of the known methods for producing 
monoclonal antibodies or immunoglobulins by recombinant 



DNA technology conies from the necessity in the vast 
majority of cases to clone simultaneously the and Y L 
domains corresponding to the specific binding site of 4 chain 
immunoglobulins. The animals and especially camelids 
which produce heavy-chain immunoglobulins according to 
the invention, and possibly other vertebrate species are 
capable of producing heavy-chain immunoglobulins of 
which the binding site is located exclusively in the V nH 
domain. Unlike the few heavy-chain immunoglobulins pro- 
duced in other species by chain separation or by direct 
cloning, the camelid heavy-chain immunoglobulins have 
undergone extensive maturation in vivo. Moreover their V 
region has naturally evolved to function in absence of the 
W L . They are therefore ideal for producing monoclonal 
antibodies by recombinant DNA technology. As the obten- 
tion of specific antigen binding clones does not depend on a 
stochastic process necessitating a very large number of 
recombinant cells, this allows also a much more extensive 
examination of the repertoire. 

This can be done at the level of the non rearranged V HH 
repertoire using DNA derived from an arbitrarily chosen 
tissue or cell type or at the level of the rearranged V HH 
repertoire, using DNA obtained from B lymphocytes. More 
interesting however is to transcribe the mRNA from anti- 
body producing cells and to clone the cDNA with or without 
prior amplification into an adequate vector. This will result 
in the obtention of antibodies which have already undergone 
affinity maturation. 
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Hie examination of a huge repertoire should prove to be 
particularly useful in the search for antibodies with catalytic 
activities. 

The invention thus provides libraries which can be gen- 
erated in a way which includes part of the hinge sequence, 
the identification is simple as the hinge is directly attached 
to the y H[f domain. 

These libraries can be obtained by cloning cDNA from 
lymphoid cells with or without prim PCR amplification. The 
PGR primers are located in the promoter, leader or frame- 
work sequences of the V ///7 for the 5' primer and in the 
hinge, CH 2 , CH 3 , 3' untranslated region or polyAtail for the 
3' primer. A size selection of amplified material allows the 
construction of a library limited to heavy chain immuno- 
globulins. 

In a particular example, the following 3' primer in which 
a Kpnl site has been constructed and which corresponds to 
amino-acids 313 to 319 (CGC CAT CAA GOT AAC AGT 
TGA)(SEQ ID NO:47) is used in conjunction with mouse 
V HH primers described by Sestry et al and containing a Xho 



AO Cl'C CAO CIO C t'C GAO TCT GAG TCI GO SEQ ID NO:48 
AG CTC CAG CTG CTC GAG TCT GAG TCT GG SEQ [D NO:49 
AG GTC CAG CTT CTC GAG TCT GAG TCT GG SEQ ID NO:50 



These primers yield a library of camelid heavy chain 
immunoglobulins comprising the V HH region (related to 
mouse or human subgroup III), the hinge and a section of 
CH 2 . 

In another example, the cDNA is polyadenylated at its 5' 
end and the mouse specific V HH primers are replaced by a 
poly T primer with an inbuilt Xhol site, at the level of 
nucleotide 12. 



CTCGAGT 12 . 

The same 3' primer with a Kpnl site is used. 

This method generates a library containing all subgroups 
of immunoglobulins. 

Part of the interest in cloning a region encompassing the 
hinge-CH 2 link is that in both y2 and y3, a Sac site is present 
immediately after the hinge. This site allows the grafting of 
the sequence coding for the W HH and the hinge onto the Fc 
region of other immunoglobulins, in particular the human 
IgGj and IgG 3 which have the same amino acid sequence at 
this site (Glu 2 „ 6 Leu 2 , 17 ). 

As an example, the invention contemplates a cDNA 
library composed of nucleotide sequences coding for a 
heavy-chain immunoglobulin, such as obtained by perform- 
ing the following steps: 

a) treating a sample containing lymphoid cells, especially 
periferal, lymphocytes, spleen cells, lymph nodes or 
another Typhoid tissue from a healthy animal, espe- 
cially selected among the Camelids, in order to separate 
the lymphoid cells, 

b) separating polyadenylated RNAfrom the other nucleic 
acids and components of the cells, 

c) reacting the obtained RNA with a reverse transcriptase 
in order to obtain the corresponding cDNA, 

d) contacting the cDNA of step c) with 5' primers corre- 
sponding to mouse domain of four-chain 
immunoglobulins, which primer contains a determined 
restriction site, for example an Xhol site and with 3' 
primers corresponding to the N-terminal part of a C H 2 
domain containing a Kpnl site, 
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e) amplifying the DNA, 

f) cloning the amplified sequence in a vector, especially in 
a bluescript vector, 

g) recovering the clones hybridizing with a probe corre- 
5 sponding to the sequence coding for a constant domain 

from an isolated heavy-chain immunoglobulin. 
This cloning gives rise to clones containing DNA 
sequences including the sequence coding for the hinge. It 
thus permits the characterization of the subclass of the 
10 immunoglobulin and the SacI site useful for grafting the 
FV^ to the Fc region. 

The recovery of the sequences coding for the heavy-chain 
immunoglobulins can also be achieved by the selection of 
clones containing DNA sequences having a size compatible 
with the lack of the C^l domain. 

It is possible according to another embodiment of the 
invention, to add the following steps between steps c) and d) 
of the above process: 

in the presence of a DNA polymerase and of deoxyribo- 
nucleotide triphosphates, contacting said cDNA with 
20 oligonucleotide degenerated primers, which sequences 
are capable of coding for the hinge region and 
N-terminal W HH domain of an immunoglobulin, the 
primers being capable of hybridizing with the cDNA 
and capable of initiating the extension of a DNA 
25 sequence complementary to the cDNA used as 
template, 
recovering the amplified DNA. 

The clones can be expressed in several types of expres- 
sion vectors. As an example using a commercially available 

30 vector Immuno PBS (Huse et al: Science (1989) 246, 1275), 
clones produced in Bluescript® according to the above 
described procedure, are recovered by PCR using the same 
Xhol containing 5' primer and a new 3' primer, correspond- 
ing to residues 113-103 in the framework of the 

35 immunoglobulins, in which an Spe site has been con- 
structed: TC TTA ACT AGT GAG GAG ACG GTG ACC 
TG (SEQ ID NO:51). This procedure allows the cloning of 
the \ HH in the Xho/Spc site of the Immuno PBS vector. 
However, the 3' end of the gene is not in phase with the 

40 identification "tag" and the stop codon of the vector. To 
achieve this, the construct is cut with Spe and the 4 base 
overhangs are filled in, using the Klenow fragment after 
which the vector is religated. A further refinement consists 
in replacing the marker ("tag") with a poly hist id i nc so that 

45 metal purification of the cloned V HH can be performed. To 
achieve this a Spe LcoRI double stranded oligonucleotide 
coding for 6 histidines and a termination codon is first 
constructed In synthesis of both strands followed by heating 
and annealing: 



CTACTG CAC CAC CAT CAC CAT CAC TAA* TAG* SEQ ID NO:52 
AC GTG GTG GTA GTG GTA GTG ATT A TCTTAA SEQ ID NO:53 



55 The vector containing the insert is then digested with Spel 
and EcoRI to remove the resident "tag" sequence which can 
be replaced by the poly-His/termination sequence. The pro- 
duced Y HH can equally be detected by using antibodies 
raised against the dromedary V HH regions. Under laboratory 

60 conditions. V ;/;/ regions arc produced in the Immuno PBS 

The invention also relates to a DNA library composed of 

immunoglobulin, such as obtained from cells with rear- 
65 ranged immunoglobulin genes. 

In a preferred embodiment of the invention, the library is 
prepared from cells from an animal previously immunized 
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against a determined antigen. This allows the selection of 
antibodies having a preselected specificity for the antigen 
used for immunization. 

In another embodiment of the invention, the amplification 
of the cDNA is not performed prior to the cloning of the 
cDNA. 

The heavy-chain of the four-chain immunoglobulins 
remains sequestered in the cell by a chaperon protein (BIP) 
until il has combined w ill) a light chain. The binding site for 
the chaperon protein is the C„l domain. As this domain is 
absent from the heavy chain immunoglobulins, their secre- 
tion is independent of the presence of the BIP protein or of 
the light chain. Moreover tile inventors have shown that tile 
obtained immunoglobulins are not sticky and accordingly 
will not abnormally aggregate. 

The invention also relates to a process for the preparation 
of a monoclonal antibody directed against a determined 
antigen, the antigen binding site of the antibody consisting 
of heavy polypeptide chains and which antibody is further 
devoid of light polypeptide chains, which process com- 

immortalizing lymphocytes, obtained for example from 
the peripheral blood of Camelids previously immu- 
nized with a determined antigen, with an immortal cell 
and preferably with myeloma cells, in order to form a 
hybridoma, 

cull tiring the immortalized ceils (hybridoma) formed and 
recovering the cells producing the antibodies having 
the desired specificity. 

The preparation of antibodies can also be performed 
without a previous immunization of Camelids. 

According to another process for the preparation of 
antibodies, the recourse to the technique of the hybridoma 
cell is not required. 

According to such process, antibodies are prepared in 
vitro and they can be obtained by a process comprising the 
steps of: 

cloning into vectors, especially into phages and more 
particularly filamentous bacteriophages, DNA or 
cDNA sequences obtained from lymphocytes espe- 
cially PBLs of Camelids previously immunized with 
determined antigens, 

transforming prokaryotic cells with the above vectors in 
conditions allowing the production of the antibodies, 

selecting the antibodies for their heavy-chain structure 
and further by subjecting them to antigen-affinity 
selection, 

recovering the antibodies having the desired specificity, 
In another embodiment of the invention the cloning is 
performed in vectors, especially into plasmids coding for 
bacterial membrane proteins. Procaryotic cells arc then 
transformed with the above vectors in conditions allowing 
the expression of antibodies in their membrane. 

The positive cells are further selected by antigen affinity 
selection. 

The heavy chain antibodies which do not contain the C^l 
domain present a distinct advantage in this respect. Indeed, 
the C^l domain binds to BIP type chaperone proteins 
present within eukaryotic vectors and the heavy chains are 
not transported out of the endocytoplasmic reticulum unless 
light chains are pre sent. This means that in eukaryotic cells, 
efficient cloning of 4-chain immunoglobulins in non mam- 
malian cells such as yeast cells can depend on the properties 
of the resident BIP type chaperone and can hence be very 
difficult to achieve. In this respect the heavy chain antibodies 
of the invention which lack the C H1 domain present a 
distinctive advantage. 
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In a preferred embodiment of the invention the cloning 
can be performed in yeast either for the production of 
antibodies or for the modification of the metabolism of the 
yeast. As example, Yep 52 vector can be used. This vector 

5 has the origin of replication (ORI) 2ft of the yeast together 
with a selection marker Leu 2. 

The cloned gene is under the control of gall promoter and 
accordingly is inducible by galactose. Moreover, the expres- 
sion can be repressed by glucose which allows the obtention 

10 of very high concentration of cells before the induction. 
The cloning between BamHI and Sail sites using the same 
strategy of production of genes by PCR as the one described 
above, allows the cloning of camelid immunoglobulin genes 
in E. coli. As example of metabolic modulation which can 

15 be obtained by antibodies and proposed for the yeast, one 
can site the cloning of antibodies directed against cyclins. 
that is proteins involved in the regulation of the cellular 
cycle of the yeast (TIBS 16 430 J. D. McKinney, N. Heintz 
1991). Another example is the introduction by genetic 

20 engineering of an antibody directed against CD, S , which 
antibody would be inducible (lor instance by gall), within 
the genome of the yeast. The CD 28 is involved at the level 
of the initiation of cell division, and therefore the expression 
of antibodies against this molecule would allow an efficient 

25 control of multiplication of the cells and the optimization of 
methods for the production in bioreactors or by means of 
immobilized cells. 

In yet another embodiment of the invention, the cloning 
vector is a plasmid or a eukaryotic virus vector and the cells 

30 to be transformed are eukaryotic cells, especially yeast cells, 
mammalian cells for example CHO cells or simian cells 
such as Vero cells, insect cells, plant cells, or protozoan 
cells. 

For more details concerning the procedure to he applied 

35 in such a case, reference is made to the publication of Marks 
et al, J. Mol. Biol. 1991, 222:581-597. 

Furthermore, starting from the immunoglobulins of the 
invention, or from fragments thereof, new immunoglobulins 
or derivatives can be prepared. 

40 Accordingly immunoglobulins replying to the above 
given definitions can be prepared against determined anti- 
gens. Especially the invention provides monoclonal or poly- 
clonal antibodies devoid of light polypeptide chains or 
antisera containing such antibodies and directed against 

45 determined antigens and for example against antigens of 
pathological agents such as bacteria, viruses or parasites As 
example of an! [gens or am igcilic determinants against w hich 
antibodies could be prepared, one can cite the envelope 
glycoproteins of viruses or peptides thereof, such as the 

50 external envelope glycoprotein of a HIV vims, the surface 
antigen of the hepatitis B virus. 

Immunoglobulins of the invention can also be directed 
against a protein, hapten, carbohydrate or nucleic acid. 
Particular antibodies according to the invention are 

55 directed against the galactosyl^- 1 -3-galactose epitope. 

The immunoglobulins of the invention allow further the 
preparation of combined products such as the combination 
of the heavy-chain immunoglobulin or a fragment thereof 
with a toxin, an enzyme, a drug, a hormone. 

60 As example one can prepare the combination of a heavy- 
chain immunoglobulin bearing an antigen binding site rec- 
ognizing a myeloma immunoglobulin epitope with the abrin 
or mistletoe lectin toxin. Such a construct would have its 
uses in patient specific therapy. 

65 Another advantageous combination is that one can pre- 
pare between a heavy-chain immunoglobulins recognizing 
an insect gut antigen v>. ith a toxin specific for insects such as 
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the toxins of the different serotypes of Bacillus thuringiensis 
or Bacillus sphaericus. Such a construct cloned into plants 
can be used to increase the specificity or the host range of 
existing bacterial toxins. 

The invention also proposes antibodies having different 
specificities on each heavy polypeptide chains. These 
multifunctional, especially bifunctional antibodies could be 
prepared by combining two heavy chains of immunoglobu- 
lins of the invention or one heavy chain of an immunoglo- 
bulin of the invention with a fragment of a four-chain model 
immunoglobulin. 

The invention also provides hetero-specific antibodies 
which can be used for the targetting of drugs or any 
biological substance like hormones. In particular they can be 
used to selectively target hormones or cytokines to a limited 
category of cells. Examples are a combination of a murine 
or human antibody raised against inicrlcukin 2 (II.,) and a 
heavy-chain antibody raised against CD., cells. This could be 
used to reactivate CD 4 cells which have lost their IL 2 

The heavy-chain immunoglobulins of the invention can 
also be used for the preparation of hetero-specific antibodies. 
These can be achieved either according to the above 
described method by reduction of the bridges between the 
different chains and reoxydation, according to the usual 
techniques, of two antibodies having different specificities, 
but it can also be achieved by serial cloning of two anti- 
bodies for instance in the Immuno pBS vector. 

In such a case, a first gene corresponding to the V HH 
domain comprised between Xho site and a Spe site is 
prepared as described above. A second gene is then prepared 
through an analogous way by using as 5' extremity a primer 
containing a Spe site, and as 3' extremity a primer containing 
a termination codon and an EcoRI site. The vector is then 
digested with EcoRI and Xhol and further both V HH genes 
are digested respectively by Xho/Spe and by Spe/EcoRI. 

After ligation, both immunoglobulin genes are serially 
cloned. The spacing between both genes can be increased by 
the introduction of addition codons within the 5' Spel primer. 

In a particular embodiment of the invention, the hinge 
region of IgG2 immunoglobulins according to the invention 
is semi-rigid and is thus appropriate for coupling proteins. In 
such an application proteins or peptides can be linked to 
various substances, especially to ligands through the hinge 
region used as spacer. Advantageously the fragment com- 
prises at least 6 amino acids. 

According to the invention it is interesting to use a 
sequence comprising a repeated sequence Pro-X, X being 
any amino-acid and preferably Gin, Lys or Glu, especially a 
fragment composed of al leasl a 3-fold repeal and preferably 
of a 12-fold repeat, for coupling proteins to ligand, or for 
assembling different protein domains. 

The hinge region or a fragment thereof can also be used 
for coupling proteins to ligands or for assembling different 
protein domains. 

Usual techniques for the coupling are appropriate and 
especially reference may be made to the technique of protein 
engineering by assembling cloned sequences. 

The antibodies according to this invention could be used 
as reagents for the diagnosis in vitro or by imaging tech- 
niques. The immunoglobulins of the invention could be 
labelled with radio-isotopes, chemical or enzymatic markers 
or chemiluminescent markers. 

As example and especially in the case of detection or 
observation with the immunoglobulins by imaging 
techniques, a label like technetium, especially technitium 99 
is advantageous. This label can be used for direct labelling 
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by a coupling procedure with the immunoglobulins or 
fragments thereof or for indirect labelling after a step of 
preparation of a complex with the technitium. 

Other interesting radioactive labels are for instance 
; indium and especially indium ill, or iodine, especially I 131 , 
I 125 and I 123 . 

For the description of these techniques reference is made 
to the FR patent application published under number 
2649488. 

1Q In these applications the small size of the Y HH fragment 
is a definitive advantage for penetration into tissue. 

The invention also concerns monoclonal antibodies react- 
ing with anti-idiotypes of the above-described antibodies. 

The invention also concerns cells or organisms in which 
heavy-chain immunoglobulins have been cloned. Such cells 

15 or organisms can be used for the purpose of producing 
heavy-chain immunoglobulins having a desired preselected 
specificity, or corresponding to a particular repertoire. They 
can also be produced for the purpose of modifying the 
metabolism of the cell which expresses them. In the case of 

20 modification of the metabolism of cells transformed with the 
sequences coding for heavy-chain immunoglobulins, these 
produced heavy-chain immunoglobulins are used like anti- 
sense DNA. Antisense DNAis usually involved in blocking 
the expression of certain genes such as for instance the 

25 variable surface antigen of trypanosomes or other patho- 
gens. Likewise, the production or the activity of certain 
proteins or enzymes could be inhibited by expressing anti- 
bodies against this protein or enzyme within the same cell. 
The invention also relates to a modified 4-chain immu- 

30 noglobulin or fragments thereof, the Y H regions of which 

acids of heavy chain immunoglobulins, especially by 
sequences of the V Hfl domain. A particular modified 
domain of a four-chain immunoglobulin, is characterized in 

35 that the leucine, proline or glutamine in position 45 of the 
V H regions has been replaced by other amino acids and 
preferably by arginine, glutamic acid or cysteine. 

A further modified or V L domain of a four-chain 
immunoglobulin, is characlcri/cd by linking of CUR loops 

40 together or to FW regions by the introduction of paired 
cysteines, the CDR region being selected between the CDR 1 
and the CDR 3 , the FW region being the FW : region, and 
especially in which one of the cysteines introduced is in 
position 31, 33 of the CDR ± or 45 of FW 2 and the other in 

45 CDR 3 . 

Especially the introduction of paired cysteines is such that 
the CUR 3 loop is linked to the FW2 or CDR1 domain and 
more especially the cysteine of the CDR3 of the is linked 
to a cysteine in position 31, 33 of CUR, or in position 45 of 
so FW 2 . 

In another embodiment of the invention, plant cells can be 
modified by the heavy-chain immunoglobulins according to 
the invention, in order that they acquire new properties or 
increased properties. 
55 The heavy-chain immunoglobulins of the invention can 
be used for gene therapy of cancer for instance by using 
antibodies directed against proteins present on the tumor 
cells. 

In such a case, the expression of one or two Y HH genes 
60 can be obtained by using vectors derived from parvo or 
adeno viruses. The parvo viruses are characterized by the 
fact that they are devoid of pathogenicity or almost not 
pathogenic for normal human cells and by the fact that they 
are capable of easily multiplying in cancer cells (Russel S. 
65 J. 1990, Immunol. Today II. 196-200). 

The heavy-chain immunoglobulins are for instance 
cloned within HindlH/Xbal sites of the infectious plasmid of 
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the murine MVM virus (pMM984). (Merchlinsky et al, 
1983, J. Virol. 47, 227-232) and then placed under the 
control ol the MVM38 promoter. 

The gene of the V HH domain is amplified by PCR by 
using a 5' primer containing an initiation union and a 5 
Hindlll site, the 3' primer containing a termination codon 
and a Xbal site. 

This construct is then inserted between positions 2650 
(Hindlll) and 4067 (Xbal) within the plasmid. 

The efficiency of the cloning can be checked by transfec- w 
tion. The vector containing the antibody is then introduced 
in permissive cells (NB-E) by transfection. 

The culls arc recovered after two days and the presence of 
V HH regions is determined with an ELISA assay by using 
rabbit antiserum reacting with the V /7 „ part. 

The invention further allows the preparation of catalytic 15 
antibodies through different ways. The production of anti- 
bodies directed against components mimicking activated 
states of substrates (as example vanadate as component 
mimicking the activated state of phosphate in order to 
produce their phosphoesterase activities, phosphonate as 20 
compound mimicking the peplidic binding in order to pro- 
duce proteases) permits to obtain antibodies having a cata- 
lytic function. Another way to obtain such antibodies con- 
sists in performing a random mutagenesis in clones of 
antibodies for example by PCR, in introducing abnormal 25 
bases during the amplification of clones. These amplified 
fragments obtained by PCR are then introduced within an 
appropriate vector lor cloning Their expression at the 
surface of the bacteria permits the detection by the substrate 
of clones having the enzymatic activity. These two 30 
approaches can of course be combined. Finally, on the basis 
of the data available on the structure, for example the data 
obtained by XRay crystallography or NMR, the modifica- 
tions can be directed. These modifications can be performed 
by usual techniques of genetic engineering or by complete 35 
synthesis. One advantage of the W HH of the heavy chain 
immunoglobulins of the invention is the fact that they are 
sufficiently soluble. 

The heavy chain immunoglobulins of the invention can 
further be produced in plant cells, especially in transgenics 4!! 
plants. As example the heavy chain immunoglobulins can be 
produced in plants using the pMon530 plasmid (Roger et al. 
Melh Kn/ym 153 1566 1987) constitutive plant expression 
vector as has been described for classical four chain anti- 
bodies (Hiat et al. Nature 342 76-78, 1989) once again using 45 
the appropriate PCR primers as described above, to generate 
a DNA fragment in the right phase. 

Other advantages and characteristics of the invention will 
become apparent in the examples and figures which follow. 
FIGURES 

FIGS. 1(A), (15). and ((') Characterisation and purification 
of camel IgG by affinity chromatography on Protein A and 
Protein G sepharose (Pharmacia) 

FIG. (A) shows, after reduction, the SDS-PAGE protein 55 
profile of the adsorbed and non adsorbed fractions of Cam- 
clus dromedarius serum. The fraction adsorbed on Protein A 
and eluted with NaCl 0.f5M acetic acid 0.58% show upon 
reduction (lane c) three heavy chain components of respec- 
tively 50, 46 and 43 Kd and light chain (rabbit IgG in lane 60 
a). The fractions adsorbed on a Protein G Sepharose 
(Pharmacia) derivative which has been engineered to delete 
the albumin binding region (lane e) and eluted with O.fM gly 
HC1 pH 2.7 lacks the 46 Kd heavy chain which is recovered 
in the non adsorbed fraction (lane f). None of these com- 65 
ponents are present in the fraction non adsorbed on Protein 
A (lane d), lane b contains the molecular weight markers. 
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FIGS. (B) and (C) By differential elation, immunoglobu- 
lin fractions containing the 50 and 43 Kd heavy chain can be 
separated. 5 ml of C. dromadarius serum is adsorbed onto a 
5 ml Protein G sepharose column and the column is exten- 
sively washed with 20 mM phosphate buffer, pH 7.0. Upon 
elution with pH 3.5 buffer (O.f 5M NaCl, 0.58% acetic acid) 
a 100 Kd component is eluted which upon reduction yields 
a 43 Kd heavy chain, (lane f ). After column eluant absor- 
bance has fallen to background level a second immunoglo- 
bulin component of 170 Kd can be eluted with pH 2.7 buffer 
(0.1M glycine HC). This fraction upon reduction yields a 50 
Kd heavy chain and a board light chain band (lane 2). The 
fraction non adsorbed on Protein G is then brought on a 5 ml 
Protein A Sepharose column. After washing and elution with 
pH 3.5 buffer (0.1 5M NaCl. 0.589' acetic acid) a third 
immunoglobulin of 100 Kd is obtained which consists solely 
of 46 Kd heavy chains (lane 3). 

FIGS. 2(A) and 2(B) Immunoglobulins of Camelus 
bactriamts, Lama vicugna, Lama gluimi and Lama pa cos to 
Protein A (A lanes) and to Protein G (G lanes) analyzed on 
SDS-PAGE before FIG. (A) and after reduction FIG. (B) 

10 fil of serum obtained from the different species were 
added to Eppendorf® tubes containing 10 mg of Protein A 
or Protein G sepharose suspended in 400 fil of pH 8.3 
immunoprecipitation buffer (NaCl 0.2.M, Tris 0.01M; 
EDTA0.01M, Triton X100 1%, ovalbumin 0.1%). The tubes 
were slowly rotated for 2 hours al 1° ( '. After centrifugation 
the pellets were washed 3 times in buffer and once in buffer 
in which the Triton and ovalbumin had been ommitted. The 
pellets were then resuspended in the SDS-PAGE sample 
solution 70 fil per pellet with or without dithiotreitol as 
reductant. After boiling for 3 min at 100° C, the tubes were 
centrifuged and the supernatants analysed. In all species 
examined the unreduced fractions FIG. (A) contain in addi- 
tion to molecules of approximately 170 Kd also smaller 
major components of approximately 100 Kd. In the reduced 
sample FIG. (B) the constituant heavy and light chains are 
detected. In all species a heavy chain component (marked by 
an asterisk *) is present in the material eluted from the 
Protein A but absent in the material eluted from the Protein 
G. 

FIGS. 3(AHQ: IgG 1( IgG 2 and IgG 3 were prepared from 
serum obtained from healthy or Trypanosoma evansi 
infected Camelus dromedarius (CATT titer 1/160 (3) and 
analysed by radioimmunoprecipitation or Western Blotting 
for anti trypanosome activity 

FIG. (A) 35 S methionine labelled Trypanosome evansi 
antigens lysate (500.000 counts) was added to Eppendorf 
tubes containing 10 fi\ of serum or, 20 fig of IgG 1; IgG 2 or 
IgG 3 in 200 fil of pH 8.3 immunoprecipitation buffer con- 
taining 0.1M TLCK as proteinase inhibitor and slowly 
rotated at 4° C. during one hour. The tubes were then 
supplemented with 10 mg of Protein A Sepharose suspended 
in 200 fil of the same pH 8.3 buffer and incubated at 4° C. 
for an additional hour. 

After washing and centrifugation at 15000 rpm for f2 s, 
each pellet was resuspended in 75 fil SDS-PAGE sample 
solution containing DTT and heated for 3 min. at f 00° C. 
After centrifugation in an Eppendorf minifuge at 15000 rpm 
for 30 s, 5 al of the supernatant w as saved for radioactivity 
determination and the reminder analysed by SDS-PAGE and 
fluorography. The counts/5 fil sample are inscribed on for 
each fine. 

FIGS. (B) and (C) 20 fig of IgG 1; IgG 2 and IgG 3 from 
healthy and trypanosome infected animals were separated by 
SDS-PAGE without prior reduction or heating. The sepa- 
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rated samples were then electro transferred to a nitrocellu- 
lose membrane, one part of the membrane was stained with 
Ponceau Red to localise the protein material and the 
reminder incubated with 1% ovalbumin in TST buffer (Tris 
10 mM, NaCl 150 mM, Tween 0.05%) to block protein 
binding sites. 

After blocking, the membrane was extensively washed 
with TST buffer and incubated for 2 hours with 35 S-labelled 
trypanosome antigen. After extensive washing, the mem- 
brane was dried anil analysed by autoradiography. To avoid 
background and unspecific binding, the labelled trypano- 
some lysate was filtered through a 45// millipore filter and 
incubated with healthy camel iiimiunoglobulin and ovalbu- 
min adsorbed on a nitrocellulose membrane. 

FIGS. 4(A) and (B): Purified IgG3 of the camel, by 
affinity chromatography on Protein ASepharose are partially 
digested with papain and separated on Protein Asepharose. 

14 mg of purified IgG3 were dissolved in 0.1M phosphate 
buffer pH 7.0 containing 2mM EDI A. Yhey were digested 
by 1 hour incubation at 37° C. with mercurypapain (1% 
enzyme to protein ratio) activated by 5.f0 4 M cysteine. The 
digestion was blocked by the addition of excess iodoaceta- 
mide (4.10 2 M) (13). After centrifugation of the digest in an 
ependorf centrifuge for 5 min at f5000 rpm, the papain 
fragments were separated on a protein A Sepharose column 
into binding (B) and non binding (NB) fractions. The 
binding fraction was eluted from the column with 0.1M 
glycine HC1 buffer pH 1.7. 

FIG. 5: Schematic presentation of a model for IgG3 
molecules devoid of light chains. 

FIG. 6: Schematic representation of immunoglobulins 
having heavy polypeptide chains and devoid of light chains, 
regarding conventional four-chain model immunoglobulin. 
Representation of a hinge region. 

FIG. 7: Alignement of 17 V HH DNA sequences of Camel 
heavy chain immunoglobulins SEQ ID NOS:92-108 

FIGS. 8(A) and (B): Expression and purification of the 
camel V HH 21 protein from E. coli 

I HEAVY CHAIN ANTIBODIES IN CAMELIDS 

When Camelus dromedarius serum is adsorbed on Protein 
G sepharose, an appreciable amount (25-35%) of immuno- 
globulins (Ig) remains in solution which can then be recov- 
ered by affinity chromatography on Protein A sepharose 
(FIG. 1A). The fraction adsorbed on Protein G can be 
differentially eluted into a tightly bound traction (25%) 
consisting of molecules of an unreduced apparent molecular 
weight (MW) of 170 Kd and a more weakly bound fraction 
(30-45%) having an apparent molecular weight of 100 Kd 
(FIG. IB). The 170 Kd component when reduced yields 50 
Kd heavy chains and large 30 Kd light chains. The 100 Kd 
fraction is totally devoid of light chains and appears to be 
solely composed of heavy chains which after reduction have 
on apparent MW of 43 Kd (FIG. 1C). The fraction which 
does not bind to Protein G can be affinity purified and eluted 
from a Protein A column as a second 100 Kd component 
which after reduction appears to be composed solely of 46 
Kd heavy chains. 

The heavy chain immoglobulins devoid of light chains 
total up to 75% of the molecules binding to Protein A. 

As all three immunoglobulins bind to Protein A we refer 
to them as IgG: namely IgG , ( light cha in and heavy chain yl 
(50 Kd) binding to Protein G, IgG2 (heavy chain y2 (46 Kd) 
non binding to Protein G and IgG 3 (heavy chain y3 (43 Kd) 
binding to Protein G. There is a possibility that these three 
sub(classes) can he further subdivided. 
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A comparative study of old world camvlids (Camelus 
bactrianus and Camelus dromedarius) and new world cam- 
elids (Lama pacos, Lama glaniii, I., una nV»e/;<0 showed 
that heavy chain immunoglobulins are found in all species 

5 examined, albeit with minor differences in apparent molecu- 
lar weight and proportion. The new world camelids differs 
from the old world camelids in having a larger IgG 3 mol- 
ecule (heavy chain immunoglobulin binding to Protein G) in 
which the constituant heavy chains have an apparent 

10 molecular weight of 47 Kd (FIGS. 2A and B). 

The abundance of the heavy chain immunoglobulins in 
the serum of camelids raises the question of what their role 
is in the immune response and in particular whether they 
bear antigen binding specificity and if so how extensiv e is 

15 the repertoire. This question could be answered by exam- 
ining the immunoglobulins from Irvjhiiiosoiiui evansi 
infected camels (Camelus dromedarius). 

For this purpose, the corresponding fractions of IgG I , 
IgG 2 , IgG 3 were prepared from the serum of a healthy camel 

: " and from the serum ol camels with a high antitrypanosome 
titer, measured by the Card Agglutination Test (3). In radio- 
immunoprecipitation, IgGj, IgG 2 and IgG 3 derived from 
infected camel indicating extensive repertoire heterogeneity 
and complexity (FIG. 3A) were shown to bind a large 

25 number of antigens present in a 35 S methionine labelled 
trypanosome lysate. 

In blotting experiments 35 S methionine labelled trypano- 
some lysate binds to SDS PAGE separated IgGj, IgG 2 and 
IgG, obtained from infected animals (I Id. 315). 

30 This leads us to conclude that the camelid heavy chain 
IgG 2 and IgG 3 are bona fide antigen binding antibodies. 

An immunological paradigm states that an extensive 
antibody repertoire is generated by the combination of the 

35 light and heavy chain variable V region repertoires (6). The 
heavy chain immunoglobulins of the camel seem to contra- 
dict this paradigm. 

Immunoglobins are characterized by a complex I.E.F. 
(isoelectric focussing) pattern reflecting their extreme hcl- 

40 erogeneity. To determine whether the two heavy chains 
constituting the IgG, and IgG, are identical or not, the 
isoelectric focussing (I.E.F.) pattern were observed before 
and after chain separation by reduction and alkylation using 
iodoacetamide as alkylating agent. 

45 As this alkylating agent does not introduce additional 
charges in the molecule, the monomers resulting from the 
reduction and alkylation of a heavy chain homodimer will 
have practically the same isolectric point as the dimcr, 
whereas if they are derived from a heavy chain heterodimer, 

50 the monomers will in most cases differ sufficiently in iso- 
electric point to generate a different pattern in I.E.F. 

Upon reduction, and alkylation by iodoacetamide the 
observed pattern is not modified for the Camelus drom- 
edarius IgG. and IgG, indicating that these molecules are 

55 each composed of two identical heavy chains which migrate 
to the same position as the unreduced molecule they origi- 
nated from. 

In contrast, the I.E.F. pattern of IgGj is completely 
modified after reduction as the isoelectric point of each 
60 molecule is determined by the combination of the isoelectric 
points of the light and heavy chains which after separation 
will each migrate to a different position. 

These findings indicate that the heavy chains alone can 
generate an extensive repertoire and question the contribu- 
65 tion of the light chain to the useful antibody repertoire . If this 
necessity be negated, what other role does the light chain 
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Normally, isolated heavy chain from mammalian immu- 
noglobulins tend to aggregate considerably but are only 
soluhili/.ed by lighl chains (<S, '•)) winch bind lo the („1 
domain of the heavy chain. 

In humans and in mice a number of spontaneous or 
induced myelomas produce a pathological immunoglobulin 
solely composed of heavy chains (heavy chain disease). 
These myeloma protein heavy chains carry deletions in the 
C H 1 and V HH domains (10) . The reason why full lenght 
heavy chains do not give rise to secreted heavy chain in such 
pathological immunoglobulins seems to stem from the fact 
that the synthesis of Ig involves a chaperoning protein, the 
immunoglobulin heavy chain binding protein or HIP (11), 
which normally is replaced by the light chain (12). It is 
possible that the primordial role of the light chain in the 
four-cliain model immunoglobulins is that of a committed 
heavy chain chaperon and that the emergence of light chain 
repertoires has just been an evolutionary bonus. 

The camelid y2 and y3 chains are considerably shorter 
than the normal mammalian y chain. This would suggest that 
deletions have occurred in the C^l domain. Differences in 
sizes of the y2 and y3 immunoglobulins of old and new 
world camelids suggests that deletions occurred in several 
evolutionary steps especially in the C^l domain. 

II THE HEAVY CHAIN IMMUNOGLOBULINS 
OF THE CAMELIDS LACK THE C„l DOMAIN 

'Hie strategy followed for investigating the heavy chain 
immunoglobulin primary structure is a combination of pro- 
tein and cDNA sequencing; the protein sequencing is nec- 
essary to identify sequence streches characteristic of each 
immunoglobulin. The N-terminal of the immunoglobulin 
being derived from the heavy chain variable region reper- 
toire only yields information on the V HH subgroups 
(variable region of the heavy chain) and cannot be used for 
class or subclass identification. This means that sequence 
data had to be obtained from internal cn/ymalic or chemical 
cleavage sites. 

A combination of papain digestion and Protein A affinity 
chromatography allowed the separation of various frag- 
ments yielding information on the general structure of IgG3. 

The IgG3 of the camel (Camelus dronii darius) purified by 
affinity chromatography on Protein A Sepharose were par- 
tially digested with papain and the digest was separated on 
Protein A Sepharose into binding and non binding fractions. 
These fractions were analysed by SDS PAGE under reduc- 
ing and non reducing conditions (FIG 4A and B). 

The bound fraction contained two components, one of 28 
Kd and one of 14.4 Kd, in addition lo unclenved or partially 
cleaved material. They were well separated by gel electro- 
phoresis (from preparative 19% SDS-PAGE gels) under non 
reducing conditions and were further purified by electroelu- 
tion (in 50 nM amonium bicarbonate, 0.1% (w/v) SDS using 
a BioRad electro-eluter). After lyophilization of these elec- 
troeluted fractions, the remaining SDS was eliminated by 
precipitating the protein by the addition of 90% ethanol, 
mixing and incubating the mixture overnight at -20° C. (14). 
The precipitated protein was collected in a pellet by centri- 
fuging (15000 rpm, 5 min) and was used for protein 
sequencing. N-lerminal sequencing was performed using the 
automated Edman chemistry of an Applied Biosystem 477A 
pulsed liquid protein sequencer. Amino acids were identified 
as their phenylthiohydantoin (PTH) derivatives using an 
Applied Biosystem 120 PTH analyser. All chemical and 
reagents were purchased from Applied Biosystems. Analysis 
of the chromatographic data was performed using Applied 
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Biosystems software version 1.61. In every case the com- 
puter aided sequence analysis was cofirmed by direct inspec- 
tion of the chromatograms from the PTH analyser. Samples 
for protein sequencing were dissolved in cither 50' < (v v) 

5 trifluoroacetic acid (TEA) (28 Kd fragment) oi 100' ; TEA 
(14 Kd fragment). Samples of dissolved protein equivalent 
to 2000 pmol (28 Kd fragment) or 500 pmol (14 Kd 
fragment) were applied to TFA-treated glass fibre discs. The 
glass fibre discs were coated with BioBrenc (3ing) and 

1Q precycled once before use. 

N-terminal sequencing of the 28 Kd fragment yields a 
sequence homologous to the N-terminal part of y (',,2 
domain and hence to the N-terminal end of the Fc fragment. 
The N-terminal sequence of the 14.4 Kd fragment corre- 
sponds to the last lysine of a y C H 2 and the N-terminal end 

15 of a y C H 3 domain (Table 1). The molecular weight (MW) 
of the papain fragments and the identification of their 
N-terminal sequences led us to conclude that the C',,2 and 
C^3 domains of the y3 heavy chains are normal in size and 
that the deletion must occur either in the C^l or in the V HH 

20 domain to generate the shorted y3 chain. The fractions which 
do not bind to Protein A Sepharose contain two bands of 34 
and 17 Kd which are more diffuse is SDS PAGE indicating 
that they originate from the variable N-terminal part of the 
molecule (FIGS. 4A and B). 

25 Upon reduction, a single diffuse band of 17 Kd is found 
indicating that the 34 Kd is a disulfide bonded dinner of the 
17 Kd component. The 34 Kd fragment apparently contains 
the hinge and the N-terminal domain Y HH . 

30 The protein sequence data can be used to construct 
degenerate oligonucleotide primers allowing PC'R amplifi- 
cation of cDNA or genomic DNA. 

It has been shown that the cells from camel spleen imprint 
cells reacted with rabbit and anti camel immunoglobulin 

35 sera and that the spleen was hence a site of synthesis of at 
least one immunoglobulin class. cDNA was therefore syn- 
Ihelised from camel spleen mRNA. The condilions for the 
isolation of RNA were the following: total RNA was isolated 
from the dromedary spleen by the guanidium isothiocyanate 

40 method (15). mRNA was purified with oligo T-paramagnetic 
beads. 

cDNA synthesis is obtained using 1 //g mRNA template, 
an oligodT primer and reverse transcriptase 
(BOERHINGER MAN). Second strand cDNA is obtained 

45 using RNAse H and.E. coli DNA polymerase I according to 
the condition given by the supplier. 

Relevant sequences were amplified by PCR: 5 ng of 
cDNA was amplified by PCR in a 100 fA reaction mixture 
(10 mM Tris-HCl pH 8.3, 50 [M KC1, 15 mM MgCl 2 , 0.01% 

50 (w/v) gelatine, 200 /<M of each dNTP and 25 pmoles of each 
primer) overlaid with mineral oil (Sigma). Degenerate prim- 
ers containing EcoRI and Kpnl sites and further cloned into 
pUC 18. After a round of denaturing and annealing (94° C. 
for 5 min and 54° C. for 5 min), 2 units of Taq DNA 

55 polymerase were added to the reaction mixture before 
subjecting it to 35 cycles of amplification: 1 min at 94° C. 
(denature) 1 min at 54° C. (anneal), 2 min at 72° C. 
(elongate). To amplify DNA sequences between Y HH and 
C H 2 domains. (#72 clones), the PCR was performed in the 

60 same conditions with the exception that the annealing tem- 
perature was increased to 60° C. 

One clone examined (#56/36) had a sequence correspond- 
ing to the N-terminal part of a C ff 2 domain identical to the 
sequence of the 28 Kd fragment. The availability of this 

65 sequence data allowed the construction of an exact 3' primer 
and the cloning of the region between the N-terminal end of 
the V HH and the C H 2 domain. 



5,8^ 

23 

5' primers corresponding to the mouse V HH (16) and 
containing a Xhol restriction site were used in conjunction 
with the 3' primer in which a Kpnl site had been inserted and 
the amplified sequences were cloned into pBluescript®. 
Clone #56/36 which displayed two internal Haelll sites was 
digested with this enzyme to produce a probe to identify 
PCR positive clones. 

After amplification the PCR products were checked on a 
1.2% (w/v) agarose gel. Cleaning up of the PCR products 
included a phenol-chloroform extractio followed by further 
purification bv HPLC (GEN-PAC FAX column, Waters) and 
finally by using the MERMAID or GENECLEAN II kit, 
BIO 101, Inc.) as appropriate. After these purification steps, 
the amplified cDNAwas then digested with EcoRI and Kpnl 
for series #56 clones and with Xhol and Kpnl for series #72 
clones. A final phenol-chloroform extraction preceded the 
ligation into pUC 18(series #56 clones) or into pBluescript® 
(series #72 clones). 

All the clones obtained were smaller that the 860 base 
pairs In be expected if ihuy possessed ;i complcl V,, 7/ and 
C^l region. Partial sequence data corresponding to the 
N-terminal of the V HH region reveals that out of 20 clones, 
3 were identical and possibly not independent. The 
sequences obtained ressemble the human subgroup III and 
the murine subgroups Ilia and Illb (Table 2). 

Clones corresponding to two different sets of C2 protein 
sequences were obtained. A first set of sequences (#72/41) 
had a N-lcrminal ( ',,2 region identical lo the one obtained bv 
protein sequencing of the 28 Kd papain fragments of the y3 
heavy chain, a short hinge region containing 3 cysteines and 
a variable region corresponding to the framework (FR4) 
residues encoded by the J minigenes adjoining the hinge. 
The C^l domain is entirely lacking. This cDNA corresponds 
to the y3 chain (Table 4). 

In one closely related sequence (#72/1) the proline in 
position 259 is replaced by threonine. 

The sequence corresponding lo the ( ',,3 and the remaining 
part of the C^2 was obtained by PCR of the cDNA using as 
Kpnl primer a poly T in which a Kpnl restriction site had 
been inserted at the 5' end. The total sequence of the y3 chain 
corresponds to a molecular weight (MW) which is in good 
agreement with the data obtained from SDS PAGE electro- 

The sequence of this y3 chain presents similarities with 
oilier y chains excepl ihal il lacks ihe (',„ domain. Ihe V„„ 
domain being adjacent to the hinge. 

One or all three of the cysteines could be probably 
responsible for holding the two y3 chains together. 

These results have allowed us to define a model for the 
IgG3 molecule based on sequence and papain cleavage 
(FIG. 5). 

Papain can cleave the molecule on each side of the hinge 
disulfides and also between C H 2 and C H 3. Under non 
reducing conditions the Y HH domains of IgG3 can be 
isolated as disulfide linked dimer or as monomer depending 
on the site of papain cleavage. 

A second set of clones #72/29 had a slightly different 
sequence for the C,,2 and was characterized by a very long 
hinge immediately preceded by the variable domain. This 
hinge region has 3 cysteines at its C-terminal end in a 
sequence homologeous to the y3 hinge. Such second set of 
clones could represent the IgG2 subclass. For the constant 
part of the y3 and also for the putative y2, most clones are 
identical showing the y2 or y3 specific sequences. A few 
clones such as #72/1 however show minor differences. For 
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instance in the case of clones #72/1 two nucleotide differ- 
ences are detected. 

Several regions cDNA's have now been totally or 
partially sequenced with the exception of a short stretch at 

5 the N-terminal end which is primer derived. 

Upon translation the majority shows by Ihe characteristic 
heavy chain Ser 21 Cys 2 - and Tyr 90 Tyr 91 Cys 92 sequences, 
of the intra V HH region disulfide bridge linking residues 22 
and 92. All these clones have a sequence corresponding to 

l„ the framework 4 (FR4) residues of the variable region 
immediately preceding the postulated hinge sequence ( Table 
3). This sequence is generated by the J minigenes and is in 
the majority of cases similar to the sequence encoded by the 
human and murine minigenes. The sequence length between 
region Cys 92 and the C-terminal end of the V HH regions is 
variable and, in the sequences determined, range from 25 lo 
37 amino-acids as one might expect from the rearrange- 
ments of J and D minigenes varying in length. 

Several important questions are raised by the sole exist- 
ence of these heavy chain immunoglobulins in a non patho- 

;u logical situation, first of all. are they bonalkle antibodies? 
The heavy chain immunoglobulins obtained from trypano- 
some infected camels react with a large number of parasite 
antigens as shown in part I of these examples. This implies 
that the camelid immune system generates an extensive 

25 number of binding sites composed of single V HH domains. 
This is confirmed by the diversity of the V ' HH regions of the 
heavy chain immunoglobulins obtained by PCR. 

The second question is "how are they secreted?". The 
secretion of immunoglobulin heavy chains composing four- 

30 chain model immunoglobulins does not occur under normal 
conditions. A chaperoning protein, the heavy chain binding 
protein, or BIP protein, prevents heavy chains from being 
secreted. It is only when the light chain dispplaces the BIP 
protein in the endoplasmatic reticulum that secretion can 

35 occur (13). 

The heavy chain dimer found in the serum ol human or 
mice with the so-called "heavy chain disease" lack the C^l 
domains thought to harbour the BIP site (14). In the absence 
of thi domain the BIP protein can no longer bind and prevent 

40 the transport of the heavy chains. 

The presence in camels of a IgGl class composed of 
heav y and light chains making up between 25% and 50% of 
the total IgG molecules also raises the problem as to how 
maturation and class switching occurs and what the role of 

45 the light chain is. The camelid light chain appears unusually 
large and heterogeneous when examined in SDS PAGL. 

The largest dimension of an isolated domain is 40 A and 
the maximum attainable span between binding sites of a 
conventional IgG with C^l and V HH will be of the order of 

so 160 A (2V flH +2C ff l) (19). The deletion of C^l domain in 
the two types of heavy chain antibodies devoid of light 
chains, already sequenced has, as a result, a modification of 
this maximum span (FIG. 6). In the IgG3 the extreme 
distance between the extremities of the V HH regions will be 

55 of the order of 80 A (2W HH ). This could be a severe 
limitation for agglutinating or cross linking. In the IgG2 this 
is compensated by the extremely long stretch of hinge, 
composed of a 12-fold repeat of the sequence Pro-X (where 
X is Gin, Lys or Glu) and located N-terminal to the hinge 

60 disulfide bridges. In contrast, in the human IgG3, the very 
long hinge which also apparently arose as the result of 
sequence duplication does not contribute to increase the 
distance spanning Ihe two binding sites as this hinge is 
intcr-spcrscd with disulfide bridges. 

65 The single V HH domain could also probably allow con- 
siderably rotational freedom of the binding site versus the Fc 
domain. 
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Unlike myeloma liea\ y chains u bich result probably from 
C^l deletion in a single antibody producing cell, or heavy 
chain antibodies produced by expression cloning(15); the 
camelid heavy chain antibodies (devoid of iight chains) have 
emerged in a normal immunological environment and it is : 
expected that they will have undergone the selective refine- 
ment in specificity and affinity accompanying B cell matu- 
ration. 

Expression and Purification of the Camel V /; „21 i 
(DR21 on FIG. 7) Protein from E. coli 

The clones can be expressed in several types of expres- 
sion vectors. As an example using a commercially available 
vector Immuno PBS (Huse et al: Science (1989) 246, 1275), ] 
clones produced in Bluescript® according to the above 
described procedure, have been recovered by PCR using the 
same Xhol containing 5' primer and a new 3' primer, 
corresponding to residues 113-103 in the framework of the 
immunoglobulins, in which an Spe site has been con- 
structed: TC TTA ACT AGT GAG GAG ACG GTG ACC 7 
TG (SEQ ID NO:51). This procedure allowed the cloning of 
the y HH in the Xho/Spe site of the Immuno PBS vector. 
However, the 3' end of the gene was not in phase with the 
identification "tag" and the stop codon of the vector. To 
achieve this, the construct was cut with Spe and the 4 base 
overhangs were filled in, using the Klenow fragment after 
which the vector was religated. 

The expression vector plasmid ipBS (immunopBS) 
(Stratacyte) contains a pel B leader sequence which is , 
used for immunoglobulin chain expression in E. coli 
under the promotor pLAC control, a ribosome binding 
site, and stop codons. In addition, it contains a sequence 
for a c-terminal decapeptide tag. 
E. coli JM101 harboring the ipBS-V^ll plasmid was 
grown in 1 1 of TB medium with 100 ,wg/ml ampicillin 
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and 0.1 % glucose at 32° C. Expression was induced by 
the addition of 1 mM IPTG (final concentration) at an 
OD 550 of 1.0. After overnight induction at 28° C, the 
cells were harvested by centrifugation at 4.000 g for 10 
min (4° C.) and resuspended in 10 ml TES buffer (0.2M 
Tris-HCL pH 8.0, 0.5 mM EDTA, 0.5M sucrose). The 
suspension was kept on ice for 2 hours. Periplasmic 
proteins were removed by osmotic shock by addition of 
20 ml TES buffer diluted 1:4 v/v with water, kept on ice 
lor one hour and subsequently centrifugated at 12.0(10 
g for 30 min. at 4° C. The supernatant periplasmic 
fraction was dialysed against Tris-HCl pH 8.8, NaCl 50 
mM, applied on a fast Q Sepharose flow (Pharmacia) 
column, washed with the above buffer prior and eluted 
with a linear gradient of 50 mM to 1M NaCl in buffer. 
Fractions containing the V HH protein were further purified 
on a Superdex 75 column (Pharmacia) equilibrated with 
PBS buffer (0.01M phosphate pH 7.2, 0.15M NaCl). The 
yield of purified V HH protein varies from 2 to 5 mg/l cell 
culture. 

Fractions were analyzed by SDS-PAGE(I). Positive iden- 
tification of the camel V HH antibody fragment was done by 
Western Blot analysis using anlibodv raised ill rabbits 
against purified camel IgGH 3 and an anti-rabbit IgG-alkaline 
phosphatase conjugate (II). 

As protein standards (Pharmacia) periplasmic proteins 
prepared from 1 ml of IPTG-induced JMlOl/ipBS V HH 21 
were used. FIG. 8 shows: C,D:fractions from fast S 
Sepharose column chromatography (CEluted at 650 mM 
NaCl D:Eluted at 700 mM NaCl) Effractions from Super- 
dex 75 column chromatography. 

As can be seen, the major impurity is eliminated by 
ionexchange chromatography and the bulk of the remaining 
impurities are eliminated by gel filtration. 
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GGS VQTGGS LRLS CEI SGLTF D#72/4 SEQ ID NO:l 
GGSVQTGGSLRLSCAVSGFTFS #72/3 SEQ ID NO:2 
GGS EOGGGS LRLS CAI SGYTY G#72/7 SEQ ID NO:3 
GGS VQP GGS LTL S CTVS GATYS #72/17 SEQ ID NO:4 
GGS VQAGGS LRLS CTGS GF PYS #72/18 SEQ ID NO:5 

DVQLVAS GGGS VGAGGS LRL S CTAS GDS F S #72/2 SEQ ID NO:58 

EVKLVES GGGLVEP GGS LRLS CATS GF TF S Mouse V H III A SEQ ID NO: 58 

E VQLLS GGGLVQF GGS LRL S CAAS GF TF S Human V„III SEQ ID NO: 119 




G Q G T L 



G Q G T T 
G Q G T M 



G Q G T Q 

G Q G T Q 

G Q G T L 

G R G T Q 

G Q G T H 

G Q G I Q 



SEQ ID NO:9 

SKQ ID NO: 120 
SKQ ID NO: 12 1 
SKQ ID NO:122 
SKQ ID NO:9 
SKQ ID NO: 123 
SKQ ID NO: 124 



SEQ ID NO:8 
SKQ ID NO:S 
SKQ ID XO:'J 
SKQ ID XO:>9 
SKQ ID NO:(,0 
SliQ ID NO:(>: 



# 72/19 =# 72/3 

t 72 24 
#72/21 

# 72/16 



TABLE 4 



SEQ ID NO:125 

yvgss - - - - - - - y - - f - - - - - dvWGAGTTVTVSS MOUSE V H „,sequ. 

100 a b c d e f g h i j k - - - - 101 105 110 

I qpggycgygx - - - - - - - - - - cl WGQGTQVTVSS SEQ ID NO:13 

i 1 mdr i s q h - - - - - - - - - - - - gc RGQGTQVTVS L SEQ ID NO:14 

j e h 1 gpgai 1 dl kky - - - - - - k y WGQGTQVTVSS SEQ ID NO:15 

:ys t agdggs ge - - - - - - - - - my WGQGTQVTVSS SEQ ID NO:16 

I s ggs eel pi 1 f - - - - - - - - - d y WGQGTQVTVSS SEQ ID NO:17 

wkywt cgaqt ggyf - - - - - - - g q WGQGTQVTVSS SEQ ID NO:18 

I t emgacdar wat 1 at r t f a y n y WGQGTQVTVSS SEQ ID NO:19 Random sample 

ikdr t I waepr ew - - - - - - - - n n WGQGTQVTVS S SEQ ID NO:20 

i r fsspvgst si 1 e e - sdy - - ny WGQGTQVTVSS SEQ ID NO:21 

los i yys i 1 xi ey - - - - - - - - kyWOQOTQVTVSS SEQ ID NO:22 Different camel 

i pcympt mpappi I ds fgw - - d d F GQGTQVTVS S SEQ ID NO:23 region 

! s t ywyct t apy - - - - - - - - - n y WGQGTQVTVS S SEQ ID NO:24 

3i ewygcnl r t t f - - - - - - - - t r WGQGTQVTVSS SEQ ID NO:25 



5,840,526 

31 32 

TABLE 4-continued 

21 n q 1 aggwyl dpnywl svgay - - ai WGQGTQVTVS S SEQ ID NO:26 
24 r 1 t emgacdar w a t 1 at r t faynyWGROTQVTVSS SEQ ID NO:27 



25dgwt r keggi gl pws vqcedgynyWGQGTQVTVSSSEQll) NO:2S 
27 ds ypchl 1 - - - - - - - - - - - - - - d v WGRGTQVTVS S SEQ ID NO:29 

29 v e ypi admc s - r yGDPGTQVTVSS SEQ ID NO:30 

V J 

CDR3 

Human ^ mouse — size range 0-19 aa over 600 entries. 

camel 8-24 aa 18 entries 



EVQLVESGGG LVQPGGSLRL SCAASG SEQ ID NO:S0 
GO SVQGOGST.RT, SCAISG SEQ ID NO:84 
GG SVQAGGSLRL SCASSS SEQ ID NO:SS 



1GLEWVS SEQ ID N 



RITIS RDNSKNTI.YI. QMNSI RAEI ) 1'A VY YCAR SEQ II ) NO:S2 \ CDR3 ; WGQGTLVT VSS SEQ ID NO:83 
RFTIS QDSTLKTMYL LMNNLKPEDTGTY YCAA SEQ ID NO:86 i CDR3 ' WGQGTQVT VSS SEQ ID NO:91 
RFTIS QDSAKNTVYL QMNSLKPEDTAMY YCKI SEQ ID NO:90 ! CDR3 j WGQGTQVT VSS SEQ ID NO:87 



WGQGTQVT VS 

WGQGTQVT VSS f- EPKrPQPQPKPQl 



GTNEVCKCPKCP (\PELPGG PSVFVFP SEQIDNO:91 




>\J'J :HO<_._ _PSVOHI-r SI-« ]l.)NO£f. _ 




ill gamma I KVI )KK — 

human gamma 2 KVKVTV 
human gamma 4 KVDKRV 



J\PELLGG PSVFLFP SEQIDNO:12b 

}\PELLGG PSVFLFP SEQ ID NO:127 

krkccvecppcp appvag— psvflfp seq id NO: 

ESKYGPPCPSCP ^PEFLGG PSVFLFP SEQ ID NO:129 
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( D ) OTHER INFORMATION: ,'label=FRAMEWORK 1 
) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



( i i ) MOLECULE TYPE: peptide 

( i x ) FEATURE: 

( A ) NAMK KKY: Hon,., 



) LENGTH: 22 amino acids 



( B ) LOCATION: 1..22 
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( i x ) FEATURE: 



) SEQUENCE DESCRIPTION: SEQ ID NO:17: 




( A ) NAMIi KHY: Don, 

( B ) LOCATION: 1..L7 



( B ) TYPE: amino acid 

( C ) STHANDEDNESS: single 

( D ) TOPOLOGY: linear 
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I SEQUENCE DESCRIPTION: SEQ 10 NO:: V 
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( x i 1 SEQUENCE DESCRIPTION: SEQ ID NO:33: 
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( i x ) FEATURE: 
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) SEQUENCE DESCRIPTION: SEQ ID NO:«: 
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( i ) SEQUENCE CHARACTERISTICS: 
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l : IM ORMATION FOR SEQ ID NO:51: 
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( : IM ORMATION FOR SEQ ID NO:57: 




( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:59: 




( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:6S: 
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( i x ) FEATURE: 



) FEATURE: 

( A ) NAME/KEY: Domain 
I H j I.OCA1IOX: l..:4 



( B ) TYPE: amino acid 

( (' ! STRANDKDNKSS: Mnplc 

( D , 1'01'OIOCY: linear 





( A ) NAME/KEY: Region 
( B ) LOCATION: 1..12 




( A ) NAME/KEY: Domain 

( B ) LOCATION: 1..12 

( D ) OTHER INFORMATION: .l:,bcl=('DR.^ 




( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Cameras diomedarius 
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(ii) MOLECULE TYPE: protein 
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( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
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GAGTACACAG TATTTCTGCA AATGGACAAT 
CTGAAACCTG AGGACACGGC GATGTATTAC TGTAAAACAG CCCTACAACC TGGGGGTTAT 



2 ) INFORMATION FOR SEQ ID NO:94: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 443 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 



i , SI-WHNVl- 01 St'UIlTON: SHQ ID NO: H: 



TCAGGAAAGC AGCGTGAGGG GGTCGCAGCC ATTAATAGTG GCGGTGGTAG GACATACTAC 



GCGGCGGTCC CAGCCCACTT GGGACCTGGC GCCATTCTTG ATTTGAAAAA GTATAAGTAC 
TGGGGCCAGG GGACCCAGGT CACCGTCTCC TCACTAGCTA GTTACCCGTA CGACGTTCCG 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:Q5: 



CTCGAGTCTG GGGGAGGGTC GGTGCAGGCT GGAGGGTCTC TGAGACTCTC CTGTAATGTC 
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CGTGAGGGGG TCACAGCGAT TAACACTGAT GGCAGTATCA TATACGCAGC CGACTCCGTG 
AAGGGCCGAT TCACCATCTC CCAAGACACC GCCAAGGAAA CGGTACATCT CCAGATGAAC 



2 ) INFORMATION FOR SEQ ID NO:96: 

( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

i ) MOLECULE TYPE: cDNA 

i ) SEQUENCE DESCRIPTION: SEQ ID NO:96: 



AACTACTGGG GCCGGGGGAC CCAGGTCACC GTCTCCTCAC TAGCTAGTTA CCCGTACGAC 
GTTCCGGACT ACGGTTCTTA ATAGAATTC 

( 2 ) INFORMATION FOR SEQ ID NO:97: 



( A ) LENGTH: 424 base pair. 

( C ) STRANDEDM'ISS: .in.lt 
( D ) TOPOLOGY: linear 



i ) SHQI KMi; DKSCRIP1 "ON: SEQ ID NO:97: 



CACGGGATTC CGCTCAATGG TTACTACATC GCCTGGTTCC GTCAGGCTCC TGGGAAGGGG 
CGTGAGGGGG TCGCAACAAT TAATGGTGGT CGCGACGTCA CATACTACGC CGACTCCGTG 



i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 415 base pairs 

( ( , MR WDI DM >s mi, 1 
( D ., fOPOLOGY: linear 



i ) UOLfcCI IE 1TPE: cDNA 

i ) SEQUENCE DESCRIPTION: SEQ ID NO:98: 



) INFORMATION FOR SEQ ID NO:99: 



, MDAMIHW-* -in.* 



i . SI Ol I N. I 111 M Kll'l ON: M O ll> NO: '": 



AACCTGAAAC CTGAAGACAC GGGCACCTAT TACTGTGCTG CAGAACTAAG TGGTGGTAGT 
TGTGAATTGC CTTTGCTATT TGACTACTGG GGCCAGGGCA CCCAGGTCAC CGTCTCCTCA 



2 ) INFORMATION FOR SEQ ID NO: 100: 



D ) TOPOLOGY: linear 



V: SEQ ID NO:100: 

CTCGAGTCTG GGGGAGGCTC GGTGCAGGCT GGAGGGTCTC TGAGACTCTC CTGTACAGGC 
TCTGGATTCC CCTATAGTAC CTTCTGTCTG GGGTGGTTCC GCCAGGCTCC AGGGAAGGAG 



AGAATTC 

I : i INFORMATION FOR SEQ ID NO:101: 

( i ) SEQUENCE CHARACTERISTICS: 
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i : INFORMATION FOR SEQ ID XO:104: 
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i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 433 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNkSS: single 



i i KkQl KM Di:S(. RIl'l.O.V SkQ ID M):l')-: 



GGTGGCTGGT ATTTGGACCC GAATTACTGG CTCTCTGTGG GTGCATATGC CATCTGGGGC 
CAGGGGACCC AGGTCACCGT CTCCTCACTA GTTACCCGTA CGACGTTCCG GACTACGGTT 



2 ) INFORMATION FOR SEQ ID NO:105: 

( i ) SEQUENCE CHARACTERISTICS: 

single 



( i i ) MOLECULE TYPE: cDNA 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:105: 
CAGGTGAAAC TGCTCGAGTC TGGGGGAGGC TCGGTGCAGG CTGGGGGGTC TCTGACACTC 
TCTTGTGTAT ACACCAACGA TACTGGGACC ATGGGATGGT TTCGCCAGGC TCCAGGGAAA 



2 ) INFORMATION FOR SEQ ID NO: 106: 



(A,l MM 1 I'll: -1,1 l, a , c ,,;,!,„ 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNHSS: sinplc 
( D ) TOPOLOGY: linear 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:106: 
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( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:108: 

CTCGAGTCAG GTGTCCGGTC TGATGTGCAG CTGGTGGCGT CTGGGGGAGG CTCGGTGCAG 60 

GCTGGAGGCT CTCTGAGACT CTCCTGTACA GCCTCTGGAG ACAGTTTCAG TAGATTTGCC 120 

ATGTCTTGGT TCCGCCAGGC TCCAGGGAAG GAGTGCGAAT TGGTCTCAAG CATTCAAAGT 18 0 

AATGGAAGGA CAACTGAGGC CGATTCCGTG CAAGGCCGAT TCACCATCTC CCGAGACAAT 240 

TCCAGGAACA CAGTGTATCT GCAAATGAAC AGCCTGAAAC CCGAGGACAC GGCCGTGTAT 300 

TACTGTGGGG CAGTCTCCCT AATGGACCGA ATTTCCCAAC ATGGGTGCCG GGGCCAGOGA 360 



) INFORMATION FOR SEQ ID NO:109: 

( i ) SEQUENCE CHARACTERISTICS 
( A ) LENGTH: 18 amino ai 
( F! ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x i ) SEQUENCE DESCRIPTION: SEQ 



)R SEQ ID NO:110: 

i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 18 amino acids 

i II i rOPOl OGY: lincai 



( i i ) MOLECULE TYPE: peptide 
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( 2 ) INFORMATION FOR SEQ ID NO:121: 

( i ) SEQUENCE CHARACTERISTICS: 




( i i ) MOLECULE TYPE: peptide 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:121: 



( A ) LENGTH: 11 an 
( B ) TYPE: amino ac 
( D ) TOPOLOGY: lii 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 amino acids 




( i i ) MOLECULE TYPE: peptide 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:123: 




( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:124: 



2 ) INFORMATION FOR SEQ ID NO: 126: 

( i ) SKQIIKVk t'HARAi I ■R.Sl'K S: 

( A ) LENGTH: 67 amino adds 
( B ) TYPE: amino acid 



i , SI-.QI I' HI M Rll'l OV SH> 111 \0:l>: 



( 2 ) INFORMATION FOR SliO in NO:!:": 

( i ) SEQUENCE CHARACTERISTICS: 
( B ) TYPE: amino acid 



) MOl K'l I li ITPli: pcp'iJc 



2 ) INFORMATION FOR SEQ IP NO: I :s: 

( i ) SEQUENCE CHARACTERISTICS: 



, P , I'OI'OIOGY: lineai 
i ) MOLECULE TYPE: peptide 
i ) SEQUENCE DESCRIPTION: SEQ ID NO:128: 
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( 2 ) INFORMATION FOR SEO ID NO:130: 




We claim: 

1. An immunoglobulin comprising l\vo heavy polypeptide 
chains, each heavy chain consisting of a complete antigen 
binding site, said immunoglobulin containing a variable 
(V /7// ) region and a constant region, said constant region 
being devoid of first constant domain C H 1, wherein the 
immunoglobulin is devoid of polypeptide light chains, and 
wherein the variable region contains in position 45 an amino 
acid which is not a leucine, proline or glutamine residue. 

2. An immunoglobulin comprising two heavy polypeptide 
chains, each heavy chain consisting of a complete antigen 
binding sile. said immunoglobulin containing a variable 
(Shh) region and a constant region, said constant region 
being devoid of first constant domain C H 1, wherein the 
immunoglobulin is devoid of polypeptide light chains, and 
comprising: 



lour frameworks in ils variable region, which frameworks 
comprise an amino-acid sequence selected from the 
following sequences: 

for the framework f domain 



GGSVQTGGSLRLSCEISGLTFD (SEQ ID NO:l) 
G G S V Q T G G S L R L S C A V S G F S F S (SEQ ID NO:2) 
GGSEQGGGSLRLSCAISGYTYG (SEQ ID NO:3) 
GGSVQPGGSLTLSCTVSGATYS (SEQ ID NO:4) 
GGSVQAGGSLRLSCTGSGFPYS (SEQ ID NO:5) 
GGSVQAGGSLRL SCVAGFGTS (SEQ ID NO:6) 
G G S V Q A G GSLRLSCVSFSP S S (SEQ ID NO:7) 



for the framework 4 domain 



WGQGTQVTVSS (SEQ ID NO:8) 
WGQGTLVTVSS (SEQ ID NO:9) 
WGQGAQVTVSS (SEQ ID NO:10) 
WGQGTQVTASS (SEQ ID NO:ll) 
40 RQQGTQVTVSL (SEQ ID NO:12) 



and/or, in that its variable region comprises CDR 
domains, comprising for the CDR3 domain 



ALQPGGYCGYGX CL (SEQ ID NO:62) 

VSLMDRISQH GC (SEQ ID NO:63) 

VPAHLGPGAILDLKKY KY (SEQ ID NO:64) 

FCYSTAGDGGSGE MY (SEQ ID NO:65) 

ELSGGSCELPLLF DY (SEQ ID NO:66) 

DWKYWTCGAQTGGYF G Q (SEQ ID NO:<>7) 

RLTEMGACDARWATLATRTFAYNY (SEQ ID NO:68) 

QKKDRTRWAEPREW N N (SEQ II) VO:«] 

GSRFSSPVGSTSRLES-SDY--NY (SEQ ID NO:70) 

ADPSIYYSILXIBY KY (SEQ ID NO:71) 

DSPCYMPTMPAPPIRDSFGW--DD (SEQ ID NO:72) 

55 TS S FYWYCTTAPY N V (SEQ ID NO:73) 

TEIEWYGCNLRTTF TR (SEQ ID NO:74) 

NQLAGGWYLDPNYWLSVGAY--AI (SEQ ID NO:75) 
RLTEMGACDARWATLATRTFAYNY (SEQ ID NO:76) 
DGWTRKBGGIGLPWSVQCBDGYNY (SEQ ID 
NO:77) 

60 DSYPCHLL D V (SEQ ID VO:7S) 

VBYPIADMCS RY (SEQ ID NO:79) 



and/or, 

65 in that its constant region comprises C H 2 and C H 3 
domains comprising an amino acid sequence selected 
from the following sequences: 
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for the C H 2 domain: 



APF.T ,T IrGPTVFIFPPKPKD VI ,SrTT TP (SEQ ID NO:31) 
APELPCtCtPSVFVFPTKPKDVLSISGRP (SEQ ID NO:32) 
APELPGGPSVFVFPPKPKDVLSISGRP (SEQ ID NO:33) 
APELLGGPSVFIFPPKPKDVLSISGRP (SEQ ID NO:34) 



for the C H 3 domain: 



GQTREPQVYTLA (SEQ ID NO:35) 
GQTREPQVYTLAPXRLEL (SEQ ID NO:36) 
GQPREPQVYTLPPSRDEL (SEQ ID NO: 109) 
GQPREPQVYTLPPSREEM (SEQ ID NOrllO) 
GQPREPQVYTLPPSQEEM (SEQ ID NO:lll) 



and/or, 

in that its hinge region comprises from 0 to 50 amino 

3. The immunoglobulin of claim 2, wherein said hinge 
region comprises a polypeptide having an amino acid 
sequence selected from the following sequences: 



GTNEVCKCPKCP (SEQ ID NO:37) 
' EPKIPQPQPKPQPQPQPQPKPQPKPEPECTCKCP (SEQ ID NO:38) 



4. An immunoglobulin comprising two heavy polypeptide 
chains, each heavy chain consisting of a complete antigen 
binding site, said immunoglobulin containing a variable 
(V^^) region and a constant region, said constant region 
being devoid of first constant domain C H 1, wherein the 
immunoglobulin is devoid of polypeptide light chains, and 
wherein said immunoglobulin is of class 2 (IgG2) or class 3 
(IgG3) and is obtained by a process comprising cloning 
DNAor cDNA sequences encoding an immunoglobulin or a 
V HH domain having a determined specific antigen binding 
site into the pMM984 plasmid and transfecting cells with the 
recombinant plasmid. 

5. An immunoglobulin comprising two heavy polypeptide 
chains, each heavy chain consisting of a complete antigen 
binding site, said immunoglobulin containing a variable 
(V HH ) region and a constant region, said constant region 
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being devoid of first constant domain C^l, wherein the 
immunoglobulin is devoid of polypeptide light chains, and 
wherein said immunoglobulin is of class 2 (IgG2) or class 3 
(IgG3) and is obtained by a process comprising cloning 

5 DNAor cDNA sequences encoding an immunoglobulin or a 
V HH domain having a determined specific antigen binding 
site into a vector wherein the vector is a vector appropriate 
for expression in plant cells, and the transformed recombi- 
nant cells are plant cells. 

10 6. The immunoglobulin of claim 5, wherein said vector is 
pMon530. 

7. An immunoglobulin comprising two heavy polypeptide 
chains, each heavy chain consisting of a complete antigen 

1S binding site, said immunoglobulin containing a variable 
(Shh) region and a constant region, said constant region 
being devoid of first constant domain C H 1, wherein the 
immunoglobulin is devoid of polypeptide light chains, and 
wherein said immunoglobulin is directed against a biologj- 

20 cal organism selected from the group consisting of a 
bacteria, a virus and a parasite. 

8. An immunoglobulin comprising two heavy polypeptide 
chains, each heavy chain consisting of a complete antigen 
binding site, said immunoglobulin containing a variable 

25 (Shh) region and a constant region, said constant region 
being devoid of first constant domain C^l, wherein the 
immunoglobulin is devoid of polypeptide light chains, and 
wherein said immunoglobulin is directed against a biologi- 
cal molecule. 

30 9. The immunoglobulin of claim H wherein said biological 
molecule is selected from the group consisting of a protein, 
a hapten, a carbohydrate, a nucleic acid, a cellular receptor, 
and a membrane protein. 

10. An immunoglobulin comprising two heavy polypep- 

35 tide chains, each heavy chain consisting of a complete 
antigen binding site, said immunoglobulin containing a 
variable (V NH ) region and a constant region, said constant 
region being devoid of first constant domain C^l, wherein 
the immunoglobulin is devoid of polypeptide light chains, 

40 and wherein said immunoglobulin is conjugated with a 
toxin. 



